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CHAPTER1 


Introduction 


The world of modern music and sound production is multifaceted. It's an excit- 
ing world of Creative individuáis: musicians, engineers, producers, manufactur- 
ers and businesspeople who are experts in such fields as music, acoustics, 
electronics, production, broadcast media, multimedia, marketing, graphics, law 
and the day-to-day workings of the business of music. The combined efforts of 
these talented people work together to create a single end product: marketable 
music. The process of turning a Creative spark into a final product takes com- 
mitment, talent, a Creative production team, a marketing strategy and, often, 
money. Over the history of recorded sound, the process of capturing music and 
transforming it into a marketable product has radically changed. In the past, 
the process of turning one's own music into a final product required the use of 
a commercial recording studio, which was (and still is) equipped with special- 
ized equipment and professional staff. With the introduction of the large-scale 
integrated (LSI) Circuit, mass production and mass marketing—three of the 
most powerful forces in the Information Age—another option has arrived on 
the scene: the radical idea that musicians, engineers and/or producers can have 
their own project studio. Along with this concept comes the realization that 
almost anyone can afford, construct and learn to master a personal audio pro¬ 
duction facility. In short, we're living in the midst of a techno-artistic revolution 
that puts more power, artistic control and knowledge directly into the hands of 
Creative individuáis from all walks of life. 

On the techno side, those who are new to the world of modern digital audio 
and multitrack production, musical instrument digital interface (MIDI), mixing, 
remixing and their production environments should be aware that years of 
dedicated practice are often required to develop the skills that are needed to 
successfully master the art and application of these technologies. A person new 
to the recording or project studio environment (Figures 1.1 through 1.3) might 
easily be awestruck by the amount and variety of equipment that's involved in 
the process; however, when we become familiar with the tools, toys and tech- 
niques of recording technology, a definite order to the studio's makeup soon 
begins to appear, with each piece of equipment being designed to play a role 
in the overall scheme of music and audio production. 
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FIGURE 1.1 

The Hit Factory, USA. 
(Courtesy of Solid 
State Logic, www. 
solid-state-logic. 
com.) 



FIGURE 1.2 

Cream Studio, 
London, UK. 
(Courtesy of Solid 
State Logic, www. 
solid-state-logic. 
com.) 



The goal of this book is to serve as a guide and reference tool to help you 
become familiar with the recording and production process. When used in 
conjunction with mentors, lots of hands-on experience, further reading, Web 
searching, soul searching and simple common sense, this book, I hope, will 
help you understand the equipment and day-to-day practices of sound record¬ 
ing and production. 
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FIGURE 1.3 

Patchwork Record- 
ings, Atlanta, GA. 
(Courtesy of Russ 
Berger Design Group, 
Inc., www.rbdg.com.) 


C D I Y J 

do it yourself 

Tutorial: Diggin’ 

This book, by ¡ts very nature, ¡s an overview of recording 
technology and production. It’s a very in-depth one, but 
there’s absolutely no way that ¡t can fully devote ¡tself to 
all of the topics. However, we’re lucky enough to have 
the Web at our disposal to help us dig deeper into a 
particular subject that we might not fully understand, or 
simply want to know more about. Giga-tons of sites can 
be found that are dedicated to even the most offbeat 
people, places and things ... and search engines can 
even help you find obscure ¡nformation on how to fix a 
self-sealing stem-bolt on a 1905 steam-driven nut 
cracker. As such, I strongly urge you to use the Web as 
an additional guide. For example, if there’s a subject that 


Deep into the Web 

you just don’t get—look it up on www.wikipedia.org or 
simply “Google” it. Of course, there’s a wealth of info 
that can be found by searching on the innumerable 
www.youtube.com videos that relate to any number of 
hardware Systems, software toys and production 
techniques. Further ¡nformation relating to this book and 
the recording industry at large can also be found at 
www.modrec.com. Digging deeper into the Web will 
certainly provide you with a different viewpoint or another 
type of explanation ... and having that “AH HA!” lightbulb 
go off (as well as the “hokey pokey”) IS definitely what 
it’s all about. 

David Miles Huber (www.modrec.com) 


Although it's taken the modern music studio about 90 years to evolve to its 
current level of technological sophistication, we have moved into an important 
evolutionary phase in the business of music and music production: the digital 
age. Truly, this is an amazing time in production history—we live in a time 
when we can choose between an amazing array of cost-effective and powerful 
tools for fully realizing our Creative and human potential. As always, patience 
and a nose-to-the-grindstone attitude are needed in order to learn how to use 
them effectively; in short, it can free you up for the really important stuff: 
making music and audio productions. In my opinión, these are definitely the 
good ol' days! 
















The Recording Studio 


THE RECORDING STUDIO 

The commercial music studio (Figures 1.4 and 1.5) is made up of one or more 
acoustic spaces that are specially designed and tuned for the purpose of captur- 
ing the best possible sound on a recording médium. In addition, these facilities 


FIGURE 1.4 

Broken Radio Studio, 
San Francisco (built 
in 1969 by Mercury 
Records and 
redesigned and run 
by audio pioneer Bill 
Putnam). (Courtesy of 
Broken Radio, www. 
brokenradio.com; 
photo by Matt 
Boudreau.) 



FIGURE 1.5 

Electronic Arts, 
Vancouver, Cañada. 
(Courtesy of 
Walters-Storyk 
Design Group; 
designed by Beth 
Walters and John 
Storyk, www.wsdg. 
com. Photo by Robert 
Wolsch.) 
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are often structurally isolated in order to keep outside sounds from entering the 
room and being recorded (as well as to keep inside sounds from leaking out 
and disturbing the surrounding neighbors). In effect, the most important char- 
acteristics that go into the making and everyday workings of such a facility 
inelude: 

■ A professional staff 

■ Professional equipment 

■ Professional, yet comfortable working environment 

■ Optimized acoustic and recording environment 

■ Optimized control room mixing environment. 

Recording studio spaces vary in size, shape and acoustic design (Figures 1.6 
and 1.7) and usually reflect the personal taste of the owner or are designed to 
accommodate the music styles and production needs of clients, as shown by 
the following examples: 



FIGURE 1.6 

Floor plan of Sony/ 
Tree’s Music Studio, 
Nashville, TN. 
(Courtesy of Russ 
Berger Design Group, 
Inc., www.rbdg.com.) 



FIGURE 1.7 

Floor plan of Paisley 
Park’s Studio A, 
Chanhassen, MN. 
(Courtesy of Paisley 
Park Studios.) 






















































































The Recording Studio 


■ A studio that records a wide variety of music (ranging from classical to 
rock) might have a large main room with smaller, isolated rooms off to 
the side for unusually loud or soft instruments, vocals, etc. 

■ A studio designed for orchestral film scoring might be larger than other 
studio types. Such a studio will often have high ceilings to accommodate 
the large sound buildups that are often generated by a large the number 
of studio musicians. 

■ A studio used to produce audio for video, film dialogue, vocals and 
mixdown might consist of only a single, small recording space off the 
control room for overdub purposes. 

In fact, there is no secret formula for determining the perfect studio design. Each 
studio design has its own sonic character, layout, feel and decor that are based 
on the personal tastes of its owners, the designer (if any), and the going studio 
rates (based on the studio's investment return and the supporting market 
conditions). 

During the 1970s, studios were generally small. Because of the advent of (and 
overreliance on) artificial effects devices, they tended to be acoustically "dead" 
in that the absorptive materials tended to suck the life right out of the room. 
The basic concept was to eliminate as much of the original acoustic environ- 
ment as possible and replace it with artificial ambience. 

Fortunately, since the mid-1980s, many commercial studios that have the phy- 
sical space have begun to move back to the original design concepts of the 1930s 
and 1940s, when studios were larger. This increase in size (along with the addi- 
tion of one or more smaller iso-booths or rooms) has revived the art of captur- 
ing the room's original acoustic ambience along with the actual sound pickup 
(Figure 1.8). In fact, through impro ved studio design techniques, we have 


FIGURE 1.8 

Skywalker Sound 
scoring stage with 
orchestra, Marín 
County, CA. (Courtesy 
of Skywalker Sound, 
www.skysound.com.) 






Introduction 


CHAPTER1 


learned how to achieve the benefits of both earlier and modern-day recording 
eras by building a room that absorbs sound in a controlled manner (thereby 
reducing unwanted leakage from an instrument to other mies in the room) 
while dispersing reflections in a way that allows the room to retain a well- 
developed reverberant and sonic personality of its own. This effect of combining 
direct and natural room acoustics is often used as a tool for "livening up" an 
instrument (when recorded at a distance), a technique that has become popular 
when recording live rock drums, string sections, electric guitars, choirs, etc. 

THE CONTROL ROOM 

A recording studio's control room (Figures 1.9 through 1.11) serves a number of 
purposes in the recording process. Ideally, the control room is acoustically iso- 
lated from the sounds that are produced in the studio, as well as from the sur- 
rounding, outer areas. It is optimized to act as a critical listening environment 
that uses carefully placed and balanced monitor speakers. This room also houses 
the majority of the studio's recording, control and effects-related equipment. At 
the heart of the control room is the recording consolé. 

The recording consolé (also referred to as the board or desk) can be thought of as 
an artist's palette for the recording engineer, producer and artist. The consolé 
allows the engineer to combine, control and distribute the input and output 
signáis of most, if not all, of the devices found in the control room. The con- 
sole's basic function is to allow for any combination of mixing (variable control 
over relative amplitude and signal blending between channels), spatial posi- 
tioning (left/right or surround-sound control over front, center, rear and sub), 
routing (the ability to send any input signal from a source to a destination) and 
switching for the multitude of audio input/output signáis that are commonly 
encountered in an audio production facility. 



FIGURE 1.9 

George Martin at Air 
Lyndhurst, London. 
(Courtesy of Solid 
State Logic, www. 
solid-state-logic. 
com.) 



The Control Room 


FIGURE 1.10 

Sweetwater Control 
Room, Ft. Wayne, IN. 
(Courtesy of Russ 
Berger Design Group, 
Inc., www.rbdg.com.) 



FIGURE 1.11 

Matt Boudreau in his 
Broken Radio control 
room, San Francisco. 
(Courtesy of Broken 
Radio, www. 
brokenradio.com; 
photo by Robert 
Cross.) 



Tape machines might be located towards the rear of a control room, while 
digital audio workstations (DAWs) are often located at the side of the consolé 
or at the functional center if the DAW serves as the room's main recording/ 
mixing device. Because of the added noise and heat generated by recorders, 
computers, power supplies, amplifiers and other devices, it's becoming more 
common for equipment to be housed in an isolated machine room that has a 
window and door adjoining the control room for easy access and visibility. In 
either case, remote-control autolocator devices (which are used for locating tape 
and media position cue points) and DAW controller surfaces (which are used 
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for computer-based remóte control and mixing functions) are often situated in 
the control room, near the engineer, for easy access to all recording, mixing and 
transport functions. Effects devices (used to electronically alter and/or augment 
the character of a sound) and other signal processors are also often placed 
nearby for easy accessibility; current designs (often being designed into an 
effects island or bay that is located directly behind the consolé). In certain 
situations, a facility might not have a large recording space at all but simply 
have a small or midsized iso-room for recording overdubs (this is often the 
case for rooms that are used in audio-for-visual post-production and/or music 
remixing). 

As with recording studio designs, every control room will usually have its own 
unique sound, feel, comfort factor and studio booking rate. Commercial control 
rooms often vary in design and amenities—from a room that's basic in form 
and function to one that is lavishly outfitted with the best toys and fully stocked 
kitchens in the business. Again, the style and layout are a matter of personal 
choice; however, as you'll see throughout this book, there are numerous guide- 
lines that can help make the most of a recording space. In addition to the layout, 
feel and equipment, it's important to remember that the people (the staff, musi- 
cians and you)—not just the equipment—will often play a prominent role in 
capturing the feel of a performance. 

THE CHANGING FACES OF THE MUSIC 
STUDIO BUSINESS 

As we've noted, the role of the professional recording studio has begun to 
change as a result of upsurges in project studios, audio for video and/or film, 
multimedia and Internet audio. These market forces have made it necessary for 
certain facilities to rethink their operational business strategies. Often, these 
changes have met with some degree of success, as is illustrated by the following 
examples: 

■ Personal production and home project studios have greatly reduced the 
need for an artist or producer to have constant and costly access to a pro¬ 
fessional facility. As a result, many pro studios now cater to artists and 
project studio owners who might have an occasional need for a larger 
space or better equipped recording facility (e.g., for string overdubs or an 
orchestral session). In addition, after an important project has been com- 
pleted in a prívate studio, a professional facility might be needed to mix 
the production down into its final form. Most business-sawy studios are 
only too happy to capitalize on these new and constantly changing market 
demands. 

■ Upsurges in the need for audio for video and film postproduction have 
created new markets that allow professional recording studios to provide 
Services to the local, national and international broadcast and visual pro¬ 
duction communities. Creative studios often enter into lasting relation- 
ships with audio-for-visual and broadcast production markets, so as to 
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thrive in the tough business of music, when music production alone 
might not provide enough income to keep a studio afloat. 

■ Studios are also taking advantage of Internet audio distribution tech- 
niques by offering Web development, distribution and other Services as 
an incentive to their clients. 

■ A number of studios are also jumping directly into the business of music 
by offering advisory, business and networking Services to artists and bands 
... sometimes signing the artists and funding tours in exchange for a piece 
of the business pie. 

These and other aggressive marketing strategies (many of which may be unique 
to a particular area) are being widely adopted by commercial music and record- 
ing facilities to meet the changing market demands of new and changing media. 
No longer can a studio afford to place all of its eggs in one media basket. 
Tapping into changes in market forces and meeting them with new Solutions is 
an important factor for making it (or simply keeping afloat) in the business of 
music production and distribution. Let's take a moment to say that all-impor- 
tant word again: business. Make no mistake about it—starting, staffing and 
maintaining a production facility, as well as getting the clients' music heard, is 
serious work that requires dedication, stamina, innovation, guts and a definite 
dose of craziness. 

THE PROJECT STUDIO 

With the advent of affordable digital audio and analog recording systems, it's 
a foregone conclusión that the vast majority of music and audio recording/ 
production systems are being built and designed for personal use. The rise of 
the project studio (Figures 1.12 and 1.13) has brought about monumental 


FIGURE 1.12 

Tony Sheppard’s 
studio. (Courtesy of 
Tony Sheppard, 
www.tonysound.com. 
Photo by Ed Colver.) 
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FIGURE 1.13 

785 Records & 
Publishing/Denise 
Rich Songs, New 
York. (Courtesy of 
Solid State Logic, 
www.solid-state- 
logic.com.) 


changes in the business of music and professional audio, in a way that has 
affected and altered almost every facet of the audio production community. 

One of the greatest benefits of a project or portable production system centers 
around the idea that an artist can select from a wide range of tools and toys to 
generate specific sounds—or to get the particular sounds that he or she likes. 
This technology is often extremely powerful, as the components combine to 
create a vast palette of sounds and handle a wide range of task-specific functions. 
Such a system might inelude one or more keyboard synthesizers, synth modules, 
samplers, drum machines, a Computer (with a digital audio workstation and 
sequencing package), effeets devices and audio mixing capabilities. 

Systems like these are constantly being installed in the homes of working and 
aspiring musicians. Their sizes range from a córner in an artist's bedroom to a 
larger system that has been installed within a dedicated project studio. All of 
these system types can be designed to handle a wide range of applications and 
have the important advantage of letting the artist produce his or her music in 
a comfortable, cost-effective, at-home environment whenever the Creative mood 
hits. Such production luxuries, which would have literally cost a fortune 20 years 
ago, are now within the reach of almost every working and aspiring musician. 
This revolution has been carried out under the motto "You don't have to have 
a million-dollar studio to make good music." Literally, the modern-day project 
and portable studio systems offer such a degree of cost-effective power and 
audio fidelity that they can often match the production quality of a professional 
recording facility ... all it takes is knowledge, care, dedication and patience. 

THE PORTABLE STUDIO 

Of course, as laptops have grown in power, it has become a simple matter to 
load them with your favorite DAW software and audio interface, grab your 














Knowledge is Pouier! 


FIGURE 1.14 

Studio on the go. ... 
(Courtesy of M-Audio, 
a división of Avid, 
www.m-audio.com.) 



favorite mies and headphones, put the entire system in your backpack and hit 
the road running. Such a system is literally powerful enough to let you compose, 
produce and mix in the studio ... or on the beach of a remóte seaside island 
under battery power (Figure 1.14). 

Studio in the palm of your hand 

To take these ever-shrinking analogies to the nth degree, newer handheld record- 
ing systems that can literally fit in your pocket can be used to sample and record 
sounds with professional results, using either their internal high-quality mies 
or, in some cases, external professional mies under phantom power. Truly, it is 
a small world after all (Figure 1.15)! 

KNOWLEDGE IS POWER! 

In addition to personal drive and passion, one of the most crucial aspeets 
of putting together a high-quality, cost-effective project studio is definitely 
knowledge. This all-important ingredient can be gained by: 

■ Reading about the equipment choices that are available to you on the Web 
or in the trade magazines 

■ Visiting and talking with others of like (and dissimilar) minds about their 
equipment and personal working styles 

■ Enrolling in a recording course that best fits your needs, working style 
and budget 

■ Getting your hands on equipment before you make your final purchases 
(for example, checking out the tools and toys at your favorite music store) 

■ Experience ... always the best teacher. 
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FIGURE 1.15 

Studio in the wild. 


The more you take the time to familiarize yourself with the options and pos- 
sibilities that are available to you, the less likely you are to be unhappy with 
the way you'll spent your hard-earned bucks. It is also important to point out 
that having the right equipment for the job isn't always enough—obviously, it's 
important to take the time to learn how to use your tools to their fullest poten- 
tial. Whenever possible, read the manual and get your feet wet by taking the 
various settings, functions and options for a test spin ... well befo re you're under 
the time and emotional constraints of being in a session. 

WHATEVER WORKS FOR YOU 

As you begin to research the various types of recording and supporting systems 
that can be put to use in a project studio, you'll find that a wide variety of 
options are available. There are literally hundreds, if not thousands, of choices 
for recording media, hardware types, software systems, speakers, effects devices 
... the list goes on. This should automatically tell us that no "one" tool is right 
for the job. As with everything in art (even the business of an art), there are 
many personal choices that can combine into the making of a working environ- 
ment that's right for you. Whether you: 

■ Work with a hard-disk or tape-based system 

■ Choose to use analog or digital (or both) 

■ Are a Mac or PC kind of person (practically a nonissue these days) 

■ Use this type of software or that ... 

it all comes down to the bottom line of how does it sound? How does it 
move the audience? How can it be sold? In truth, no prospective buyer will 
turn down a song because it wasn't recorded on such-n-such a machine using 
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speakers made by so-n-so—it's the feel, the emotions and the art that seáis 
the deai. 


MAKING THE PROJECT STUDIO PAY FOR ITSELF 

Beyond the obvious advantage of being able to record when, where and how 
you want to in your own project studio, there are several additional benefits to 
working in a personal environment. Here's a sample of how a project studio 
can help to subsidize itself, at any number of levels: 

■ Setting your own schedule and saving money while you're at it! An obvious 
advantage of a project studio revolves around the idea that you can create 
your own music on your own schedule. The expense incurred in using a 
professional studio requires that you be practiced and ready to roll on a 
specific date or range of days. Having your own project studio can free 
you up to lay down practice tracks and/or record when the mood hits, 
without having to worry about punching the studio's time dock. 

■ For those who are in the business of music, media production or the 
related arts or for those who wish to use their creativity as an outlet for 
their personal business, it's possible to write off the equipment, building 
and utility payments as a tax-deductible expense (see Appendix B, "Tax 
Tips for Musicians"). 

■ An individual artist or group might consider preproducing a project in 
their own studio. These time and expense billings might also be tax- 
deductible expenses. 

■ The same artists might consider recording part or all of their production 
at their own project studio. The money saved (and deducted) could be 
spent on a better mixdown facility, production, legal issues (such as 
copyright) and/or marketing. 

■ The "signed artist/superstar approach" refers to the mega-artist who, 
instead of blowing the advance royalties on lavish parties in the studio (a 
sure way never to see any money from your hard work), will spend the 
bucks on building their own professional-grade project studio (Figure 
1.16). After the project has been recorded, the artist will still have a tax- 
deductible facility that can be operated as a business enterprise. When the 
next project comes along, the artist will still have a personal facility in 
which to record and can put the advance bucks in the bank. 

LIVE/ON-LOCATION RECORDING: 

A DIFFERENT ANIMAL 

llnlike the traditional multitrack recording environment, where overdubs are 
often used to build up a song over time, live/on-location recordings are created on 
the spot, in real time, often during a single on-stage or in-the-studio perfor¬ 
mance, with little or no studio postproduction other than mixdown. A live 
recording might be very simple, possibly being recorded using only a few mies 
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FIGURE 1.16 

Whitney Houston’s 
home studio: (a) 
control room; (b) 
recording studio. 
(Courtesy of Russ 
Berger Design Group, 
Inc., www.rbdg.com.) 


that are mixed directly to two or more tracks, or a more elabórate gig might cali 
for a full-fledged multitrack setup, requiring the use of a temporary control 
room or fully equipped mobile recording van or truck (Figure 1.17) that can 
record to hard disk and/or tape. A more involved setup will obviously require 
a great deal of preparation and expertise, including a combined knowledge of 
the sound reinforcement and live recording techniques that are necessary to 
capture instruments in a manner that has enough isolation between the tracks 
so as to yield the highest degree of control over the individual instruments 
during the mixdown phase, yet still have a live performance feel. 
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FIGURE 1.17 

Sheffield mobile 
recording truck: (a) 
control room; (b) the 
truck. (Courtesy of 
Sheffield Audio-Video 
Productions, www. 
sheffieldav.com.) 



Although the equipment and system setups will be familiar to any studio engi- 
neer, live recording differs from its more controlled studio counterpart in that 
it exists in a world where the motto is "you only get one chance." When you're 
recording an event where the artist is spilling his or her guts to hundreds or 
even thousands of fans, it's critical for everything to run smoothly. Live record¬ 
ing usually requires a unique degree of preparedness, redundancy, system setup 
skills, patience, and, above all, experience. 
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AUDIO FOR VIDEO AND FILM 

In recent decades, audio has become a much more important and integral part 
of video, film and broadcast production. Prior to the advent of multichannel 
televisión sound (MTS), the DVD, Blu-ray and surround sound, broadcast audio 
was almost an afterthought. With the introduction of these new technologies, 
audio has matured to being a highly respected part of film and video media 
production (Figure 1.18). With the common use of surround sound in the 
creation of movie sound tracks, along with the growing popularity of surround- 
sound in-home entertainment Systems (and an ever-growing number of play- 
back systems for sound, visual media and Computer media), the public has 
come to expect higher levels of audio quality. In this day and age, MIDI, hard- 
disk recording, timecode, automated mixdowns and advanced processing have 
become everyday components of the audio environment, requiring that pro- 
fessionals be highly specialized and skilled in order to meet the demanding 
schedules and production complexities. 

MULTIMEDIA 

With the integration of text, graphics, MIDI, digital audio and digitized video 
into almost every facet of the personal Computer environment, the field of 
multimedia audio has become a fast-growing, established industry that repre- 
sents an important and lucrative source of income for both Creative individuáis 
and production facilities alike. 

In addition to the use of audio in multimedia producís for the Web, education, 
business and entertainment, most of the robot-zappin', daredevil-flyin' addicts 
who are reading this book are very aware that one of the largest and most lucra¬ 
tive areas of multimedia audio production is the field of scoring, designing and 



FIGURE 1.18 

Placing effects in an 
audio-for-film 
system. (Courtesy of 
Loud Technologies 
Inc., www.mackie. 
com.) 
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FIGURE 1.19 

Not your typical 
multimedia studio. 
Electronic Arts, 
Vancouver, Cañada. 
(Courtesy of 
Walters-Storyk 
Design Group. 
Designed by Beth 
Walters and John 
Storyk, www.wsdg. 
com. Photo by Robert 
Wolsch.) 



producing audio for Computer games (Figure 1.19)—Zaaaaaaappppppppp! 
Like most subcategories within audio production, this field of expertise has its 
own set of technical and scheduling requirements that can be further researched 
by reading books on the subject and through educational studies (including the 
ever-present school of hard knocks). 

THE PEOPLE WHO MAKE IT ALL HAPPEN 

"One of the most satisfying things about being in the professional audio 

[and music] industry is the sense that you are part of a community.” 

Frank Wells, editor, Pro Sound Neius 

When you get right down to the important stuff, the recording field is built 
around pools of talented individuáis and Service industries who work together 
for a common goal: producing, selling and enjoying music. As such, it's the 
people in the recording industry who make the business of music happen. 
Recording studios and other businesses in the industry aren't only known for 
the equipment that they have but are often judged by the quality, knowledge, 
visión and combined personalities of their staff. The following sections describe 
but a few of the ways in which a person can be involved in this multifaceted 
industry. In reality, the types and descriptions of a job in this techno-artistic 
industry are only limited by the imagination. New ways of expressing a passion 
for music production and sales are being created every day ... and if you see a 
new opportunity, the best way to make it happen is to roll up your sleeves and 
"just do it." 
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The artist 

The strength of a recorded performance begins and ends with the artist. All of 
the technology in the world is of little use without the existence of the central 
ingredients of human creativity, emotion and individual technique. Just as the 
overall sonic quality of a recording is no better than its weakest link, it is the 
performer's job to see that the ingredient of all music—its inner soul—is laid 
out for all to experience and hear. After all is said and done, a carefully planned 
and well-produced recording project is simply a gilded framework for the 
music's original drive, intention and emotion. 

Studio musicians and arrangers 

A project often requires additional musicians to add extra spice and depth to 
the artist's recorded performance. For example: 

■ An entire group of studio musicians might be called on to provide the 
best possible musical support for a high-profile artist or vocalist. 

■ A project might require musical ensembles (such as a choir, string section 
or background vocals) for a particular part or to give a piece a fuller 
sound. 

■ If a large ensemble is required, it might be necessary to cali in a profes- 
sional music contractor to coordínate all of the musicians and make the 
financial arrangements. The project might also require a music arranger, 
who can nótate and possibly conduct the various musical parís. 

■ A member of a group might not be available or be up to the overall 
musical standards that are required by a project. In such situations, it's 
not uncommon for a professional studio musician to be called in. 

The producer 

Beyond the scheduling and budgetary aspects of coordinating a recording 
project, it is the job of a producer to help the artist and record company create 
the best possible recorded performance and final product that reflects the artist's 
visión. A producer can be hired for a project to fulfill a number of possible 
duties or might even be given complete control over aspects of a project's art- 
istry, finances and program contení. More likely, however, a producer will act 
collaboratively with an artist or group to guide them through the recording 
process to get the best possible final product. This type of producer will often: 

■ Assist in the selection of songs. 

■ Help to focus the artistic goals and performance in a way that best conveys 
the music to the targeted audience. 

■ Help to transíate that performance into a final, salable product (with the 
technical and artistic help of an engineer and mastering engineer). 

A producer can also be chosen for his or her ability to understand the process 
of creating a final recorded project from several perspectives: business, business 
connections (often, an extremely important ingredient), musical input, Creative 


o The People Who Make It All Happen 


insight and mastery of the recording process. Because engineers spend much of 
their working time with musicians and industry professionals with the intention 
of making their clients sound good, it's not uncommon for an engineer to take 
on the role of producer or coproducer (by default or by mutual agreement). 
Conversely, as producers become increasingly more knowledgeable about 
recording technology, it's increasingly common to find them on the other side 
of the glass, sitting behind the Controls of a consolé. 

The engineer 

The job of an engineer can best be described as an interpreter in a techno- 
artistic field. He or she must be able to express the artist's music and the pro¬ 
ducéis concepts and intent through the médium of recording technology. This 
job is actually best classified as an art form, because both music and recording 
are totally subjective in nature and rely on the tastes and experiences of those 
involved. During a recording session, one or more engineers can be used on a 
project to: 

■ Conceptualize the best technological approach for capturing a perfor¬ 
mance or music experience. 

■ Document the process for other engineers or future production use. 

■ Place the musicians in the desired studio positions. 

■ Choose and place the microphones. 

■ Set levels and balances on the recording consolé or DAW mixing 
interface. 

■ Record the performance onto hard disk or tape. 

■ Overdub additional musical parts into the session that might be needed 
at a later time. 

■ Mix the project into a final master recording in any number of media and 
mono, stereo, and surround-sound formats. 

■ Help in meeting the needs for archiving and/or storing the project. 

In short, engineers use their talent and artful knowledge of recording media 
technology to convey the best possible finished sound for the intended media, 
the client and the buying public. 

Assistant engineer 

Many studios often train future engineers (or build up a low-wage staff) by 
allowing them to work as assistants who can offer help to staff and visiting 
freelance engineers. The assistant engineer might do microphone and head- 
phone setups, run tape machines or the DAW setup, help with session docu- 
mentation, do session breakdowns and (in certain cases) perform rough mixes 
and balance settings for the engineer on the consolé. With the proliferation of 
freelance engineers (engineers who are not employed by the studio but are 
retained by the artist, producer or record company to work on a particular 
project), the role of the assistant engineer has become even more important. 
It's often his or her role to guide freelance engineers through the technical 
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aspects and quirks that are peculiar to the studio ... and to generally babysit 
the place. 

Traditionally, this has been a no- or low-wage job that can expose a "newbie" 
to a wide range of experiences and situations. With hard work and luck, many 
assistants have worked their way into the hot seat whenever an engineer quits 
or is unexpectedly ill. As in life, there are no guarantees in this position—you 
just never know what surprises are waiting around the next córner. 

Maintenance engineer 

The maintenance engineer's job is to see that the equipment in the studio is 
maintained in top condition and regularly aligned and repaired when necessary. 
Of course, with the proliferation of project studios, cheaper mass-produced 
equipment, shrinking project budgets and smaller staffs, most studios will not 
have a maintenance engineer on staff. Larger organizations (those with more 
than one studio) might employ a full-time staff maintenance engineer, whereas 
freelance maintenance engineers and technical Service companies are often 
called in to Service smaller commercial studios in both major and nonmajor 
markets. 

Mastering engineer 

Often a final master recording will need to be tweaked in terms of level, equal- 
ization (EQ) and dynamics so as to present the final "master" recording in the 
best possible sonic and marketable light. If the project calis for it, this job will 
fall to a mastering engineer, whose job it is to listen to and process the record¬ 
ing in a specialized, fine-tuned monitoring environment. In fact, mastering is 
also a techno-artistic field. Beauty is definitely in the ear of the beholding client 
and one mastering engineer might easily have a completely different approach 
to the sound and overall feel to a project than the next bloke. However, make 
no mistake about it—the mastering of a project can have a profound impact on 
the final sound of a project, and the task of finding the right mastering engineer 
for the job should never be taken lightly. Further info on mastering can be found 
in Chapter 18. 

The DJ 

Let's not forget one of the more important links for getting the musical word 
out to the buying public: the disc jockey (DJ). Actually, the role of disc jockey 
can take on many modern-day forms: 

■ On the air: The DJ of the airwaves is still a very powerful voice for getting 
the word out about a particular musical product. 

■ On the floor: This DJ form often reinforces the messages from the airwaves 
or helps to promote the musical word in the mainstream and counter- 
cultural music environment. 

■ On the Web: Probably one of the more up-and-coming voices for get¬ 
ting the promotional word out to a large number of specially targeted 
audiences. 


Studio Management 



FIGURE 1.20 

“Mike” the VJ ... 
outstanding in his 
field. 


The VJ 

With the integration of video into the online and 
on-stage scene, the video jockey (VJ) has begun to 
cast a shadow across the virtual canvas of the 
music scene (Figure 1.20). Mixing videos and 
imagery in real time on stage and producing 
music videos for the Web is a relative must for 
many an aspiring artist and band. 

STUDIO MANAGEMENT 

Running a business in the field of music requires 
the special talents of businesspeople who are 
knowledgeable about the inner workings of pro- 
motion, the music studio, the music business 
and, above all, people. It requires constant atten- 
tion to quirky details that would probably be 
totally foreign to someone outside the "biz." 
Studio management tasks inelude: 


■ Management: The studio manager, who might or might not be the owner, 
is responsible for managerial and marketing decisions. 

■ Bookings: This staff person keeps track of most of the details relating to 
studio usage and billing. 

■ Competent secretarial staff: These assistants keep everyone happy and every- 
thing running as smoothly as possible. 


Note, however, that any or all of these functions often vary from studio to 
studio. These and other equally important roles are required in order to suc- 
cessfully opérate a commercial production facility on a day-to-day basis. 


MUSIC LAW 

It's never good for an artist, band or production facility to underestimate the 
importance of a good music lawyer. When entering into important business 
relationships, it's always a good idea to have a professional ally who can help 
you, your band or your company navigate the potentially treacherous waters of 
a poorly or vaguely written contract. Such a professional can serve a wide range 
of purposes, ranging from the primary duties of looking after their clients' 
interests and ensuring that they don't sign their careers away by entering into a 
life of indentured, nonprofit servitude, all the way to introducing an artist to 
the best possible music label. 

Music lawyers, like many in this business, can be involved in the working of a 
business or career in many ways; henee, various fee structures are used. For 
example, a new artist might meet up with a friend who knows about a bright, 
young, freshly graduated music lawyer who has passed the bar exam. By 
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developing a relationship early on, there are any number of potentials for build- 
ing trust, making special deais that are beneficial to both parties, etc. On the 
other hand, a more established lawyer could help solicit and shop a song, band 
or artist more effectively in a major music, TV or film market. As with most 
facets of the biz ... answers to these questions are often situational and require 
intuition, careful reference checking and the building of trust over time. Again, 
it's important to remember that a good music lawyer is often the unsung hero 
of many a successful career. 


WOMEN AND MINORITIES IN THE INDUSTRY 

Ever since its inception, males have dominated the recording 
industry. I remember many a session in which the 


only women on the scene were female artists, 
secretaries or studio groupies. Fortunately, 
over the years, women have begun to play 
a more prominent role, both in front of 
and behind the glass ... and in every facet 
of studio production and the business of 
music as a whole (Figures 1.21). Fortunately, 
since the 1990s, most of the resistance to 
including new and fresh blood into the biz has 
greatly diminished. 


No matter inho you are, uihere 
you’re from or uihat your race, gender, 
sexual or planetary orientation is, 
remember this universal truth: If your heart 
is in it and you’re uiilling to ujork hard 
enough, you’ll make it (ujhatever you 
perceive "it” to be). 

been 


BEHIND THE SCENES 

In addition to the positions listed earlier, there are scores of professionals who 
serve as a backbone for keeping the business of music alive and functioning. 
Without the many different facets of music business—technology, production, 
distribution and law—the biz of music would be very, very different. A small 
sampling of the additional professional fields that help make it happen ineludes: 

Artist management 
Artist booking agents 
A&R (artist and repertoire) 

Manufacturing 
Music and print publishing 

Distribution _ 

Web development 
Graphic arts and layout 
Equipment design 
Audio company marketing 
Studio management 
Five sound tour management 
Acoustics 
Audio instruction 
Club management 
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FIGURE 1.21 

Women’s Audio 
Mission, an organiza¬ 
ron formed to assist 
women in the 
industry. (Courtesy of 
the Women’s Audio 
Mission, www. 
womensaudiomis- 
sion.org.) 
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■ Sound system installation for nightclubs, airports, homes, etc. 

■ ... and a lot more! 

This incomplete listing serves as a reminder that the business of making music 
is full of diverse possibilities and extends far beyond the notion that in order 
to make it in the biz you'll have to sell your soul or be someone you're not. In 
short, there are many paths that can be taken in this techno-artistic business. 
Once you've found the one that best suits your own personal style, you can then 
begin the lifelong task of gaining knowledge and experience and pulling together 
a network of those who are currently working in the field. 


It’s also important to realize that finding the career niche 
that’s right for you might not happen overnight. You 
might try your hand at one aspect of production, only to 
find that your passion is ¡n another field. This isn’t a bad 


thing. As the saying goes, “Wherever you may be, there 
you are!” Finding the path that’s best for you is a lifelong 
ongoing quest ... the general idea is to enjoy the ride. 


CAREER DEVELOPMENT 

It's a sure bet that those who are interested in getting into the business of audio 
will quickly find out that it can be a tough nut to crack. For every person who 
makes it, a large number won't. In short, there are a lot of people who are 
waiting in line to get into what is perceived by many to be a glamorous biz. So, 
how do you get to the front of the line? Well, folks, here's the key: 

■ A ton of self-motivation 

■ Good networking skills 

■ An ongoing and open willingness to learn and experiment. 


The business of art (the techno-art of recording and music being no exception) 
is one that's generally reserved for self-starters. Even if you get a degree from 
XYZ college or recording school, there's absolutely no guarantee that your 
dream studio will be knocking on the door with an offer in hand (in fact, they 
won't). It takes a large dose of perseverance, talent and personality to make it. 

In fact, one of the best ways to get into the biz is to get down on 
you knees, "knight" yourself on the shoulder with a 
sword (figuratively or literally) and say: "I am now 
a_!" Fill in the blank with whatever you 


ANCIENT PROVERB 

Being “in the right place at the right 
time” means being in the wrong (or 
right) place at the wrong time a 
thousand times! 


want to be (engineer, artist, producer, etc.) and 
simply become it ... right there on the spot ... 
Shazammm! Make up a business card, start a 
business and begin contacting artists to work 
with (or make the first step toward becoming 
the Creative person you want to be). There are 
many ways to get to the top of your own personal 
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mountain. For example, you could get a diploma from a school of education 
or from the school of hard knocks (it usually ends up being from both)—but 
the goals and the paths are up to you. Like a mentor of mine always said: 
"Failure isn't a bad thing ... but not trying is!" 

The other half of the success equation lies in your ability to network with other 
people. Like the venerable expression says: "It's not [only] what you know, it's 
who you know." Maybe you have an únele or a friend in the business or a friend 
who has an únele ... you just never know where help might come from next. 
This idea of getting to know someone who knows someone else is what makes 
the business world go around. Don't be afraid to put your best face forward and 
start meeting people. If you want to work at XYZ Studios, hang out without 
being in the way. You never know, the engineer might need some help or might 
know someone who can help get you into the proverbial door. The longer you 
stick with it, the more people you'll meet, thus creating a bigger and stronger 
network than you thought could be possible. 

So, when do you start this grand adventure? When do you start building your 
career? The obvious answer is RIGHTNOW. If you're in school, you have already 
started the process. If you're just hanging out with like-minded biz folks and or 
joined a local or national organization, that, too, is an equally strong start. 
Whatever you do, don't wait until you gradúate or until some magic date in the 
future, because waiting until then will just put you that much further behind. 

Here are a few other tips on how to get started: 

■ Choose a mentor (sometimes they fall out of the sky, sometimes you have 
to develop the relationship). 

■ Contact companies in your area that might be looking for interns. 

■ Use your school counselors for intern placement. 

■ Pick the areas you want to live in (if that is a factor). 

■ Pick the companies and area that you're interested in and target them. 

■ Visit or send resumes or electronic press kits (EPKs) to companies or 
studios that interest you. 

■ Send out lots of resumes. 

■ Make a personal Web page and a MySpace or Facebook page. 

■ Follow up with a phone cali. 

■ Visit these companies just to hang out and see what they are like. 

■ Follow up with another phone cali. 

In short, make yourself visible. Try not to be afraid when sending out a resume, 
demo link or CD of your work or when asking for a job. The worst thing they 
can do is say "No." You might even keep in mind that "No" could actually mean 
"No, not right now." You might ask to see if this is the case. If so, they might 
take your persistence into account befo re saying "No" two or three times. By 
picking a market and particular area, blanketing that area with resumes or EPKs 
and knocking on doors, you just never know what might happen. If nothing 
materializes, just remember the self-motivation factor. I know it's not easy, but 
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pickyourself up (again), reevaluate your strategies and start pounding the streets 
(again). Just remember, "Failing at something isn't a bad thing ... not trying is!" 


THE RECORDING PROCESS 

In this age of pro studios, project studios, digital audio workstations, groove 
tools and personal choices, it's easy to understand how the "different strokes 
for different folks" adage applies to recording in that the differences between 
people and the tools they use allow the process of recording to be approached 
in many different ways. The cost-effective environment of the project studio has 
also brought music and audio production to a much wider audience, thus 
making the process much more personal. If we momentarily set aside the 
monumental process of creating music in its various styles and forms, the 
process of capturing sound onto a recorded médium will generally occur in 
eight distinct steps: 

■ Preparation 

■ Recording 

■ Overdubbing 

■ Mixdown 

■ Mastering 

■ Song sequence editing 

■ Product manufacturing 

■ Marketing and sales. 


Preparation 

One of the most important aspects of the recording process occurs before the 
artist and production team step into the studio: preparation. Questions like the 
following must be addressed long before going into the studio: 

■ What is the goal? 

■ What is the budget? 

■ What are the estimated studio costs? 

■ Will there be enough time to work on vocals, mixing, and other important 
issues before running out of money? 

■ How much will it cost to manufacture the CDs and/or records? 

■ What are the advertising costs? 

■ How will the music be distributed and sold? And to whom? 

■ Is the group practiced enough? 

■ If the project doesn't have a producer, who will speak for the group when 
the going gets rough? 

■ Are the instruments, voices and attitudes ready for the task ahead? 

■ Are there any legal issues to consider? 

■ How and when will the website be up and running? 

These questions and a whole lot more will have to be addressed before it comes 
time to press the big red record button. 
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Recording 

The first phase in multitrack production is the recording process. In this phase, 
one or more sound sources are picked up by a microphone or are recorded 
directly (as often occurs when recording electric or electronic instruments) to 
one or more of the isolated tracks of a recording system. Of course, multitrack 
and hard-disk recording technologies have added an amazing degree of flexibil- 
ity to the process by allowing múltiple sound sources to be captured onto and 
played back from isolated tracks in a disk- or tape-based environment. Because 
the recorded tracks are isolated from each other—with disk-based DAWs offer- 
ing an almost unlimited track count and tape capabilities usually being offered 
in track groups of eight (e.g., 8, 16, 24, 32, 48)—any number of instruments 
can be recorded and rerecorded without affecting other instruments. In addi- 
tion, recorded tracks can be altered, added and edited at any time in order to 
augment the production. 

Beyond the concept of capturing the best performance and sound to a recorded 
media, one of the key ideas within multitrack production is isolation. By record¬ 
ing a single instrument to a dedicated track (or group of tracks), it's possible to 
vary the level, spatial positioning (such as left/right or surround panning), EQ 
and signal processing and routing without affecting the level or tonal qualities 
of other instruments that are recorded onto an adjacent track or tracks (Figure 
1.22). This isolation allows leakage from nearby instruments or mic pickups to 
be reduced to such an insignificant level that individual tracks can be rerecorded 
and/or processed at a later time (during mixdown) without affecting the 
overall mix. 



FIGURE 1.22 

Basic representation 
of how isolated 
sound sources can 
be recorded to a 
DAW or multitrack 
recorder. 
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The basic tracks of a session can, of course, be built up in any number of ways. 
For example, the foundation of a session might be recorded in a traditional 
fashion, involving such acoustic instruments as drums, guitar, piano and a 
scratch vocal (used as a rough guide throughout the session until the final vocals 
can be laid down). Alternatively, these tracks might be made up of basic elec- 
tronic music loops, samples or synth tracks that will need to be transferred to 
tape or imported into a digital audio workstation (DAW). The combinations of 
working styles, studio miking, isolation and instrument arrangements are— 
literally—limitless and, whenever possible, are best discussed or worked out in 
the preparation stages. 

From a technical standpoint, the microphones for each instrument are selected 
either by experience or by experimentation and are then connected to the 
desired consolé or audio interface inputs. Once done, the mic type and track 
selection should be noted on a track sheet or piece of paper for easy input and 
track assignment in the studio or for choosing the same mic during a subse- 
quent overdub session. 

Some engineers find it convenient to standardize on a system that uses the same 
consolé mic input and tape/DAW track number for an instrument type at every 
session. For example, an engineer might consistently plug their favorite kick 
drum mic into input #1 and record it onto track #1, the snare mic onto #2, and 
so on. That way, the engineer instinctively knows which track belongs to a par¬ 
ticular instrument without having to think too much about it. When recording 
to a DAW, track ñames, groupings and favorite identifying track colors can also 
be selected so as to easily identify the instrument or grouped type. 

Once the instruments, baffle (a sound-isolating panel) and mies have been 
roughly placed and headphones that are equipped with enough extra cord to 
allow free movement have been distributed to each player, the engineer can now 
get down to business by using a setup sheet to label each input strip on the 
consolé with the ñame of the corresponding instrument. Label strips (which 
are often provided just below each channel input fader) can be marked with an 
erasable felt marker, or the age-old tactic of rolling out and marking a piece of 
paper masking tape could be used. (Ideally, the masking tape should be the 
type that doesn't leave a tacky residue on the consolé surface.) When recording 
to a DAW, tracks ñames should be typed in at this time, which will be displayed 
on the screen and above the track strips on certain controller readout displays. 

At this time, the mic/line channels can be assigned to their respective tracks. 
Make sure you fully document the assignments and other session info on the 
song or project's track sheet (Figure 1.23). If a DAW is to be used, make 
sure each input track in the session is named for easy reference and track 
identification. 

After all of the assignments and labeling have been completed, the engineer 
then can begin the process of setting levels for each instrument and mic input 
by asking each musician to play solo or by asking for a complete run-through 
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FIGURE 1.23 

Example of a studio 
track log that can be 
used for instrument/ 
track assignments. 
(Courtesy of John 
Vanderslice and Tiny 
Telephone, www. 
johnvanderslice.com.) 


of the song and listening one input at a time (using the solo function). By 
placing each of the channel and master output faders at their unity (0 dB) 
setting and starting with the EQ settings at the fíat position, the engineer 
can then check each of the track meter readings and adjust the mic preamp 
gains to their optimum level while listening for potential preamp overload. If 
necessary, a gain pad can be inserted into the path in order to help eliminate 
distortion. 

After these levels have been set, a rough headphone mix can be made so that 
the musicians can hear themselves. Mic choice and/or placements can be 
changed or EQ settings can be adjusted, if necessary, to obtain the sound the 
producer wants on each instrument, and dynamic limiting or compression can 
be carefully inserted and adjusted for those channels that require dynamic atten- 
tion. It's important to keep in mind that it's easier to change the dynamics of 
a track later during mixdown (particularly if the session is being recorded digi- 
tally) than to undo any changes that have been made during the recording 
phase. 

Once this is done, the engineer and producer can listen for extraneous sounds 
(such as buzzes or hum from guitar amplifiers or squeaks from drum pedáis) 
and eliminate them. Soloing the individual tracks can ease the process of selec- 
tively listening for such unwanted sounds and for getting the best sound from 
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an instrument without any distractions from the other tracks. If several mies are 
to be grouped into one or more tracks, the balance between them should be 
"carefully" set at this time. 

After this procedure has been followed for all the instruments, the musicians 
should do a couple of practice rundown songs so that the engineer and producer 
can listen to how the instruments sound together before being recorded. (If tape 
or disk space is not a major concern, you might consider recording these tracks, 
because they might turn out to be your best takes—you just never know!) 
During the rundown, you might consider soloing the various instruments and 
instrument combinations as a final check and, finally, monitor all of the instru¬ 
ments together. Careful changes in EQ can be made at this time, making sure 
to note these changes in the track sheet or DAW notepad for future reference. 
These changes should be made sparingly, because final compensations are prob- 
ably better made during the final mixdown phase. 

While the song is being run down, the engineer can also make final adjustments 
to the recording levels and the headphone monitor mix. He or she can then 
check the headphone mix either by putting on a pair of phones connected to 
the cue system or by routing the mix to the monitor loudspeakers. If the musi¬ 
cians can't hear themselves properly, the mix should be changed to satisfy their 
monitoring needs (fortunately, this can be done without regard to the recorded 
track levels). If several cue systems are available, múltiple headphone mixes can 
be built up to satisfy those with different balance needs. During a loud session, 
the musicians might ask you to turn up their level (or the overall headphone 
mix), so they can hear themselves above the ambient room leakage. It's impor- 
tant to note that high sound-pressure levels can cause the pitch of instruments 
to sound fíat, so musicians might have trouble tuning or even singing with their 
headphones on. To avoid these problems, tuning shouldn't be done while lis- 
tening through phones. The musicians should play their instruments at levels 
that they're accustomed to and adjust their headphone levels accordingly. For 
example, they might put only one cup over an ear, leaving the other ear free to 
hear the natural room sound. 

The importance of proper headphone levels and a good cue balance can't be 
stressed enough, because they can either help or hinder a musician's overall 
performance. The same situation exists in the control room with respect to high 
monitor-speaker levels: Some instruments might sound out of tune, even when 
they aren't, and ear fatigue can easily impair your ability to properly judge 
sounds and their relative balance. 

During the practice rundown, it's also a good idea to ask the musician(s) to 
play through the entire song so you'll know where the breaks, bridges and any 
other point of particular importance might be. Making notes and even writing 
down or entering the timing numbers (into a DAW as session markers or a 
recorder's transport autolocator) can help speed up the process of finding a 
section during a take or overdub. You can also pinpoint the loud sections at 
this time, so as to avoid any overloads. If compression or limiting is used, you 
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might keep an ear open to ensure that the instruments don't trigger an undue 
amount of gain reduction. (If the tracks are recorded digitally, you might con- 
sider applying gain reduction during mixdown.) Even though an engineer might 
ask each musician to play as loudly as possible, they'll often play even louder 
when performing together. This fact may require further changes in the mic 
preamp gain, recording level and compression/limiting thresholds. Soloing 
each mic and listening for leakage can also help to check for separation and 
leakage between the instruments. If necessary, the relative positions of mies, 
instruments and baffles can be changed at this time. 

Overdubbing 

Once the basic tracks have been laid down, additional instrument and/or vocal 
parts can be added in a process known as overdubbing. During this phase, addi¬ 
tional tracks are added by monitoring the previously recorded tape tracks 
(usually over headphones) while simultaneously recording new, doubled or 
augmented instruments and/or vocals onto one or more available tracks of a 
DAW or recorder (Figure 1.24). During the overdub (OD) phase, individual 
parts are added to an existing project until the song or soundtrack is complete. 
If the artist makes a mistake, no problem! Simply recue the DAW or rewind the 
tape back to the where the instrument begins and repeat the process until you've 
captured the best possible take. If a take goes almost perfectly except for a bad 
line or a few flubbed notes, it's possible to go back and rerecord the offending 
segment onto the same or a different track in a process known as punching in. 
If the musician lays down his or her part properly and the engineer dropped in 
and out of record at the correct times (either manually or under automation), 
the listener won't even know that the part was recorded in múltiple takes ... 
such is the magic of the recording process! 



monitor mix of 
previous tracks 



kick rirum 


snare 


drums L 


drums R 


_ quitar _ 

keyboard 

*** vocal overdub *** 


fead vocal 


FIGURE 1.24 

Overdubbing allows 
instruments and/or 
vocals to be added at 
a later time to 
existing tracks on a 
multitrack recording 
médium. 
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In an overdub session, the same procedure is followed for mic selection, place- 
ment, EQ, and level as they occurred during a recording session (now you're 
beginning to see the need for good documentation). If only one instrument is 
to be overdubbed, the problem of having other instrument tracks leak into the 
new track won't exist. However, care should be taken to ensure that the head- 
phones aren't too loud or improperly placed on the artist's head, because exces- 
sive leakage from the headphone mix can occur. 

The natural ambience of the session should be taken into account during an 
overdub. If the original tracks were made from a natural, roomy ensemble, it 
could be distracting to hear an added track that was obviously laid down in a 
different (usually deader) room environment. 

If the recorder to be used is analog, it should be placed in the master sync mode 
to ensure that the previously recorded tracks will play back in sync from the 
record head (see Chapter 5 for more info). The master sync mode is set either 
at the recorder or using its autolocator/remote control. Usually, the tape machine 
can automatically switch between monitoring the source (signáis being fed to 
the recorder or consolé) and tape/sync (signáis coming from the playback or 
record/sync heads). When recording to a DAW or tape, the control room monitor 
mix should prominently feature the instruments that's being recorded, so mis- 
takes can be easily heard. During the initial rundown, the headphone cue mix 
can be adjusted to fit the musician's personal taste. 

Mixdown 

When all of the tracks of a project have been recorded, assembled and edited, 
the time has come to individually mix the songs into their final media forms 
(Figure 1.25). The mixdown process occurs by routing the various tracks of a 
DAW or tape-based recorder through a hardware or DAW virtual mixing consolé 
to alter the overall session's program with respect to: 


FIGURE 1.25 

Basic representation 
of the mixdown 
process. 
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■ Relative level 

■ Spatial positioning (the physical placement of a sound within a stereo or 
surround field) 

■ Equalization (affecting the relative frequency balance of a track) 

■ Dynamics processing (altering the dynamic range of a track, group or 
output bus to optimize levels or to alter the dynamics of a track so that 
it "fiís" better within a mix) 

■ Effects processing (adding reverb-, delay- or pitch-related effects to a mix 
in order to augment or alter the piece in a way that is natural, unnatural 
or just plain interesting). 

Once ready, the consolé can be placed into the mixdown mode (or each input 
module can be switched to the line or tape position) and the fader label strips 
can be labeled with their respective instrument ñames. Channel and group 
faders should be set to unity gain (0 dB) and the master output faders should 
likewise be set with the monitor section being switched to feed the mixdown 
signal to the appropriate speakers. If a DAW is being used to create the final 
mix, a basic array of effects can be programmed into the session mixer and, if 
available, a controller surface can be used to facilitate the mix by giving you 
hands-on control. Note that when a DAW's own virtual mixer is used, it's pos- 
sible to virtually mix down or "export" the mix to a specified mono, stereo or 
surround soundfile without the need for an external hardware consolé. 

The engineer can then set up a rough mix of the song by adjusting the levels 
and the spatial pan positions. The producer then listens to this mix and might 
ask the engineer to make specific changes. The instruments are often soloed one 
by one or in groups, allowing any necessary EQ changes to be made. The engi¬ 
neer, producer, and possibly the group can then begin the cooperative process 
of "building" the mix into its final form. Compression and limiting can be used 
on individual instruments as required, either to make them sound fuller, more 
consistent in level, or to prevent them from overloading the mix when raised 
to the desired level. At this point, the consoles automation features, if available, 
can be used. Once the mix begins to take shape, reverb and other effects types 
can be added to shape and add ambience in order to give close-miked sounds 
a more "Uve," spacious feeling, as well as to help blend the instruments. 

If the hardware recording consolé isn't automation assisted, the fader settings 
will have to be changed during the mix in real time. This means that the engi¬ 
neer will have to memorize the various fader moves (often noting the transport 
counter to keep track of transition times). If more changes are needed than the 
engineer can handle alone, the assistant, producer or artist (who probably 
knows the transition times better than anyone) can help by controlling certain 
faders or letting you know when a transition is coming up. It's usually best, 
however, if the producer is given as few tasks as possible, so that he or she can 
concéntrate fully on the music rather than the physical mechanics of the mix. 
The engineer then listens to the mix from a technical standpoint to detect any 
sounds or noises that shouldn't be present. If noises are recorded on tracks that 
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aren't used during a section of a song, these tracks can be muted until needed. 
After the engineer practices the song enough to determine and learn all the 
changes, the mix can be recorded and faded at the end. The engineer might not 
want to fade the song during mixdown, because it will usually be performed 
after being transferred to a DAW (which can perform a fade much more 
smoothly than even the smoothest hand). Of course, if automation is available 
or if the mix is performed using a DAW, all of these moves can be performed 
much more easily and with full repeatability by using the software's own mix 
automation. 

It's usually important for levels to be as consistent as possible between the 
various takes and songs, and it's often wise to monitor at consistent, modérate 
listening levels. This is due to the variations in our ear's frequency response at 
different sound-pressure levels, which could result in inconsistencies between 
song balances. Ideally, the control room monitor level should be the same as 
might be heard at home, over the radio or in the car (between 70 and 90 dB 
SPL), although certain music styles will "want" to be listened to at higher levels. 
Once the final mix or completed project master is made, you'll undoubtedly 
want to listen to the mix over different speaker systems (ranging from the small- 
est to the biggest/baddest you can find). It's usually wise to run off a few copies 
for the producer and band members to listen to at home and in their cars. In 
addition, the mix should be tested for mono-stereo/surround compatibility to 
see if any changes in instrumental balances have occurred. If there are any 
changes in frequency balances or if phase becomes a problem when the mix is 
played in mono, the original mix might have to be modified. 

Mastering 

Once done, the final, edited mixdown of a project might be sent to a mastering 
engineer so that fine-tuning adjustments can be made to the overall recording 
with respect to: 

■ Relative level balancing between songs within the project 

■ Dynamic level (altering the dynamics of a song so as to maximize its level 
for the intended media or to tighten up the dynamic balance, overall or 
within certain frequency bands) 

■ Equalization 

■ Overall level. 

In essence, it's the job of a qualified mastering engineer to smooth over any 
level and spectral imbalances within a project and to present the final, recorded 
product in the best possible light for its intended media form. Commonly, the 
producer and/or artist will be faced with the question of whether to hire a 
qualified or well-known mastering engineer to put the finishing touches on a 
master recording, or master the project themselves into a final product (using 
the producer's, engineer's or artist's talents). These questions should be thor- 
oughly discussed in the preplanning phase, allowing for an on-the-spot change 
of plans. Who knows—you just might try mastering the project yourself. If it 
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doesn't work out, you can always work with a trusted mastering engineer to 
present your masterpiece into its best possible form. 

6 Song sequence editing 

With the advent of the DAW, the relatively cumbersome process of sequencing 
music tracks in the analog domain using magnetic tape has given way to the 
faster, easier and more flexible process of editing the final masters from hard 
disk. Using this system, all of the songs can be loaded from their final media 
form to hard disk or exported as individual mixdown files and then loaded into 
a workstation, digital audio editor or CD burning program for assembly into a 
final, edited form. 

Whenever the various soundfiles of a project are loaded into a DAW, audio 
editor or certain CD burning programs, each song or program segment can be 
tailored in a number of ways: 

■ The start and end points can be located and defined for each song. 

■ Each song can then be individually processed using EQ, overall level, 
dynamics, etc. 

■ The proper fades and silence gaps can be entered into the edit session. 

■ The completed master can then be transferred to a CD-R or final mastered 
media. 

Whenever a basic CD burning program is used to create a final master (or the 
datafile for mass producing any number of CD-R dises), each soundfile will 
need to be loaded into the playlist in its final processed and edited form. Most 
of these programs will let you enter gaps of silence (Índex marker gaps) between 
each song in seconds and possibly subdivisions of a second, while leaving a 
2-second gap at the very beginning of the program (which is necessary as part 
of the Red Book CD-Audio standard). 

When sequencing a project, the length of time between the end of one song 
and the beginning of the next can be constant, or the timings can vary accord- 
ing to the musical relationship between the songs. Decreasing the time between 
them can make a song seem to blend into the next (if they're similar in mood) 
or could create a sharp contrast with the preceding song (if the moods are dis¬ 
similar). Longer times between songs help the listeners get out of the mood of 
the previous song and prepare them for hearing something that might be quite 
different. 

It's always a good idea to make at least one master copy of the final mix sound¬ 
files and session data from the original files as a backup (just in case the record 
company, producer or artist wants to make changes at a later date). This simple 
precaution could save you a lot of time and frustration. 

ANALOG SEQUENCE EDITING 

Although the process of assembling a final master in the analog domain occurs 
less frequently than in the digital realm, it's still done. During this process, the 
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engineer edits the original mixes out from their respective reels and begins the 
process of splicing them together into a final sequence on a master reel set. At 
this time, the level test tones (which were laid down at the beginning of the 
mixdown session) should be placed at the beginning of side one. Once this is 
done, the mix master in/out edits should be tightened (to eliminate any noise 
and silence gaps) by listening to the intro and outro at high volume levels, while 
the heads are in contact with the tape (this might require that you place the 
transport into the edit mode). The tape can then be moved back and forth (a 
process known as "jogging" or "rocking" the tape) to the exact point where the 
music begins (intro) and after it ends (outro). Once the in (or out) point is 
positioned over the playback head, the exact position is marked with a grease 
pencil. If there's no noise directly in front of this spot, it's a good practice to 
cut the tape half an inch before the grease pencil mark as a safety precaution 
against editing out part of the first sound. If there is noise ahead of the first 
sound, the tape should be cut at the mark and a leader inserted at that point. 
Paper (rather than plástic) leader tape is used because plástic will often cause 
static electricity pops. 

The tail of the song might need to be monitored at even higher volume levels 
because it's usually a fade-out or an overhang from the last note and is, there- 
fore, much softer than the beginning of the song. The tape is marked and cut 
just after the last sound dies out to eliminate any low-level noises and tape hiss. 
Of course, the length of time between the end of the song and the beginning 
of the next can be constant in a sequenced project, or the timings can vary 
according to the musical relationship between the songs. 

Product manufacturing 

Last but never least in the production chain is the process of manufacturing the 
master recording into a final, salable product. Whether the final product is a 
compact disc or digital download, this process should be carefully overseen to 
ensure that the final product doesn't compromise all of the blood, sweat, tears 
and bucks that have gone into the creation of a project. These manufacturing 
phases should be carefully scrutinized, checked, and rechecked: 

■ Creating a manufacture master 

■ Art layout and printing 

■ Product packaging. 

Whenever possible, ask for a proof copy of the final duplicated product and 
artwork before it is mass produced. Receiving 10,000 copies ofyour hard-earned 
project that aren't what you wanted is as bad as being handed an accordion in 
hell. Further info on this topic can found in Chapter 19. 

8 Marketing and sales 

Although this section is mentioned last, it is by far one of the most important 
areas to be dealt with when contemplating the time, talent and financial effort 
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involved in creating a recorded product. For starters, the following questions 
(and more) should all be answered long before the Record button is pressed 
and the first downbeat is played: 

■ Who is my audience? 

■ Will the project be distributed by a record company, or will I try to sell it 
myself? 

■ What should the final product look and sound like? 

■ What's my budget and how much is this going to cost me? 

In this short section, I won't even attempt to cover this extremely important and 
complex topic, because these subjects have been fully discussed in a number of 
well-crafted books and searchable online articles. 


Because this chapter ¡s an introduction to sound and 
recording technology, l’d like to take a moment to look 
at an important concept that is central to all music, 
sound, electronics and the art of sound recording: the 


transducer. If any conceptual tool can help you to 
understand the technological underpinnings of the art 
and process of recording, this is probably it! 


THE TRANSDUCER 


1 . 26 ). 

A microphone is another example of a trans¬ 
ducer. Here, sound-pressure waves (the médium) 
act on the mic's diaphragm and are converted 
into corresponding electrical voltages. The elec- 
trical signal from the microphone can then be 
amplified (not a process of transduction because 
the médium stays in its electrical form) and fed 
to a recording device. The recorder is a device 
that changes electrical voltages into analogous 
magnetic flux signáis on magnetic tape or into 
representative digital data that can be encoded 
onto tape, hard disk or other type of disc. On 
playback, the stored magnetic signáis or digital 
data are converted back to their original electrical 
form, amplified and then fed to a speaker system. 
The speakers convert the electrical signal back 


FIGURE 1.26 

The guitar and 
microphone as 
transducers. 



Quite simply, a transducer is any device that changes one form of energy into 
another, corresponding form of energy. For example, a guitar is a transducer in 
that it takes the vibrations of picked or strummed strings (the médium), ampli- 
fies them through a body of wood, and converts these vibrations into corre¬ 
sponding sound-pressure waves, which are then perceived as sound (Figure 


The Transducer 


into a mechanical motion (by way of magnetic induction), which, in turn, 
recreates the original air-pressure variations that were picked up by the micro- 
phone ... and ta-da ... we have sound! 

As can be seen from Table 1.1, transducers can be found practically everywhere 
in the audio environment. In general, transducers (and the media they use) are 
often the weakest link in the audio system chain. Given our current technology, 
the process of changing the energy in one médium into a corresponding form 
of energy in another médium can't be accomplished perfectly (although digital 
coding gets very cióse). Noise, distortion and (often) coloration of the sound 
are introduced to some degree and, unfortunately, these effects can only be 
minimized, not eliminated. Differences in design are another major factor that 


Table 1.1 

Media Used by Transducers in the Studio to Transfer Energy 

Transducer 

From 

To 

Ear 

Sound waves in air 

Nerve impulses in the brain 

Microphone 

Sound waves in air 

Electrical signáis in wires 

Record head 

Electrical signáis in wires 

Magnetic flux on tape 

Playback head 

Magnetic flux on tape 

Electrical signáis in wires 

Phonograph 

cartridge 

Grooves cut in disk surface 

Electrical signáis in wires 

Speaker 

Electrical signáis in wires 

Sound waves in air 


FIGURE 1.27 

Example of an 
all-digital production 
studio. (Courtesy of 
Digidesign, a división 
of Avid Technology, 
Inc., www.digidesign. 
com.) 















Introduction CHAPTER1 O 


can affect sound quality. Even a slight design variation between two micro- 
phones, speaker systems, digital audio converters, guitar pickups or other trans- 
ducers can cause them to sound quite different. This factor, combined with the 
complexity of music and acoustics, helps makes the field of recording the sub- 
jective and personal art form that it is. 

It's interesting to note that fewer transducers are used in an all or largely digital 
recording system (Figure 1.27). In this situation, the acoustic waveforms that 
are picked up by a microphone are converted into electrical signáis and then 
converted into digital form by an analog-to-digital (A/D) converter. The A/D 
converter changes these continuous electrical waveforms into corresponding 
discrete numeric valúes that represent the waveform's instantaneous, analogous 
voltage levels. Arguably, digital information has the distinct advantage over 
analog in that data can be transferred between electrical, magnetic and optical 
media with little or no degradation in quality. Because the information contin¬ 
úes to be stored in its original, discrete binary form, no transduction process is 
involved (i.e., only the médium changes, while the data representing the actual 
information doesn't change). Does this mean that digital's better? It's just 
another way of expressing sound through a médium, which, in the end, is just 
one of the many possible artistic and technological choices in the making and 
recording of sound and music. 
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CHAPTER 2 


Sound and Hearing 


When we make a recording, in effect we're actually capturing and storing sound 
into a memory media so that an original event can be re-created at a later date. 
If we start with the idea that sound is actually a concept that describes the brain's 
perception and interpretation of a physical auditory stimulus, the examination 
of sound can be divided into four areas: 

■ The basics of sound 

■ The characteristics of the ear 

■ How the ear is stimulated by sound 

■ The psychoacoustics of hearing. 

By understanding the physical nature of sound and the basics of how the ears 
change a physical phenomenon into a sensory one, we can discover how to best 
convey this Science into the subjective art forms of music, sound recording and 
production. 

THE BASICS OF SOUND 

Sound arrives at the ear in the form of periodic variations in atmospheric pres- 
sure called sound-pressure tu aves. This is the same atmospheric pressure that's 
measured by the weather Service with a barometer; however, the changes in 
pressure heard by the ear are too small in magnitude and fluctuate too rapidly 
to be observed on a barometer. An analogy of how sound waves travel in air 
can be demonstrated by bursting a balloon in a silent room. Before we stick it 
with a pin, the molecular motion of the room's atmosphere is at a normal 
resting pressure. The pressure inside the balloon is much higher, though, and 
the molecules are compressed much more tightly together—like people packed 
into a crowded subway car (Figure 2.1a). When the balloon is popped ... 
"POW!" (Figure 2.1b), the tightly compressed molecules under high pressure 
begin to exert an outward forcé on their neighbors in an effort to move toward 
areas of lower pressure. When the neighboring set of molecules has been com¬ 
pressed, they will continué to exert an outward forcé on the next set of lower- 
pressured neighbors (Figure 2. le) in an ongoing outward motion that continúes 
until the molecules have used up their energy in the form of heat. 
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(a) (b) (c) 

FIGURE 2.1 

Wave movement in air as it moves away from its point of origin. (a) An intact balloon contains pressurized air. (b) When 
the balloon is popped, the compressed molecules exert a forcé on outer neighbors in an effort to move to areas of 
lower pressure. (c) The outer neighbors then exert a forcé on the next set of molecules in an effort to move to areas of 
lower pressure ... and the process continúes. 


Likewise, as a vibrating mass (such as a guitar string, a person's vocal chords or 
a loudspeaker) moves outward from its normal resting State, it squeezes air 
molecules into a compressed area, away from the sound source. This causes the 
area being acted on to have a greater than normal atmospheric pressure, a 
process called compression (Figure 2.2a). As the vibrating mass moves inward 
from its normal resting State, an area with a lower-than-normal atmospheric 
pressure will be created, in a process called rarefaction (Figure 2.2b). As the 
vibrating body cycles through its inward and outward motions, areas of higher 
and lower compression States are generated. These areas of high pressure will 
cause the wave to move outward from the sound source in the same way waves 
moved outward from the burst balloon. It's interesting (and important) to note 
that the molecules themselves don't move through air at the velocity of sound— 
only the sound wave itself moves through the atmosphere in the form of high- 
pressure compression waves that continué to push against areas of lower pressure 
(in an outward direction). This outward pressure motion is known as wave 
propagation. 

WAVEFORM CHARACTERISTICS 

A waveform is essentially the graphic representation of a sound-pressure level or 
voltage level as it moves through a médium over time. In short, a waveform lets 
us see and explain the actual phenomenon of wave propagation in our physical 
environment and will generally have the following fundamental characteristics: 
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... wave motion 

(a) 



comprcssion mrefaction compression 



FIGURE 2.2 

Effects of a vibrating 
mass on air 
molecules and their 
propagation. (a) 
Compression—air 
molecules are torced 
together to form a 
compression wave. 

(b) Rarefaction—as 
the vibrating mass 
moves inward, an 
area of lower 
atmospheric pressure 
is created. 


■ Amplitude 

■ Frequency 

■ Velocity 

■ Wavelength 

■ Phase 

■ Harmonic contení 

■ Envelope. 

These characteristics allow one waveform to be distinguished from another. The 
most fundamental of these are amplitude and frequency (Figure 2.3). The fol- 
lowing sections describe each of these characteristics. Although several math 
formulas have been included, it is by no means important that you memorize 
or worry about them. It's far more important that you grasp the basic principies 
of acoustics rather than freí over the underlying math. 

Amplitude 

The distance above or below the centerline of a waveform (such as a puré sine 
wave) represents the amplitude level of that signal. The greater the distance or 
displacement from that centerline, the more intense the pressure variation, 
electrical signal level, or physical displacement will be within a médium. Wave¬ 
form amplitudes can be measured in several ways (Figure 2.4). For example, the 
measurement of either the máximum positive or negative signal level of a wave 
is called its peak amplitude valué (or peak valué). The total measurement of 
the positive and negative peak signal levels is called the peak-to-peak valué. The 
root-mean-square (rms) valué was developed to determine a meaningful average 
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FIGURE 2.3 

Amplitude and 
frequency ranges of 
human hearing. 



frequency (Hz) 


◄- one cycle or one period -► 



amplitude. 


level of a waveform over time (one that more closely approximates the level 
that's perceived by our ears and gives a better real-world measurement of overall 
signal amplitudes). The rms valué of a sine wave can be calculated by squaring 
the amplitudes at points along the waveform and then taking the mathematical 
average of the combined results. The math isn't as important as the concept that 
the rms valué of a perfect sine wave is equal to 0.707 times its instantaneous 
peak amplitude level. Because the square of a positive or negative valué is always 
positive, the rms valué will always be positive. The following simple equations 
show the relationship between a waveform's peak and rms valúes: 


amplitude (dB) 
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rms voltage = 0.707 x peak voltage 
peak voltage = 1.414 x rms voltage 


Frequency 

The rate at which an acoustic generator, electrical signal or vibrating mass 
repeats within a cycle of positive and negative amplitude is known as the fre¬ 
quency of that signal. As the rate of repeated vibration increases within a given 
time period, the frequency (and thus the perceived pitch) will likewise increase 
... and vice versa. One completed excursión of a wave (which is plotted over 
the 360° axis of a circle) is known as a cycle (Figure 2.5). The number of cycles 
that occur within a second (frequency) is measured in hertz (Hz). The diagram 
in Figure 2.6 shows the valué of a waveform as starting at zero (0 o ). At time 
t- 0, this valué increases to a positive máximum valué and then decreases back 
through zero, where the process begins all over again in a repetitive fashion. A 
cycle can begin at any angular degree point on the waveform; however, to be 
complete, it must pass through a single 360 ° rotation and end at the same point 
as its starting valué. For example, the waveform that starts at t = 0 and ends at 
t-2 constitutes a cycle, as does the waveform that begins at t = 1 and ends at 
t= 3. 


90 ° 



270 * 


0 a 

360 " 


1B0 S 



FIGURE 2.5 

Cycle divided into the 
360° of a circle. 



FIGURE 2.6 

Graph of waveform 
amplitude over time. 
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Velocity 

The velocity of a sound wave as it travels through air at 68 °F (20 °C) is approx- 
imately 1130 feet per second (ft/sec) or 344 meters per second (m/sec). This 
speed is temperature dependent and increases at a rate of 1.1 ft/sec for each 
Fahrenheit degree increase in temperature (2 ft/sec per Celsius degree). 

Wavelength 

The wavelength of a waveform (frequently represented by the Greek letter lambda, 
X) is the physical distance in a médium between the beginning and the end of 
a cycle. The physical length of a wave can be calculated using: 

X = V/f 

where X is the wavelength in the médium 
V is the velocity in the médium 
/is the frequency (in hertz). 

The time it takes to complete 1 cycle is called the period of the wave. To illustrate, 
a 30-Hz sound wave completes 30 cycles each second or 1 cycle every l/30th 
of a second. The period of the wave is expressed using the Symbol T: 

T = 1// 

where T is the number of seconds per cycle. 

Assuming that sound propagates at the rate of 1130 ft/sec, all that's needed is 
to divide this figure by the desired frequency. For example, the simple math for 
calculating the wavelength of a 30-Hz waveform would be 1130/30 = 37.6 feet 
long, whereas a waveform having a frequency of 300 Hz would be 1130/300 = 
3.76 feet long (Figure 2.7). Likewise, a 1000-Hz waveform would work out as 
being 1130/1000 = 1.13 feet long, and a 10,000-Hz waveform would be 
1130/10,000 = 0.113 feet long. From these calculations, you can see that when- 
ever the frequency is increased, the wavelength decreases. 

REFLECTION OF SOUND 

Much like a light wave, sound reflects off a surface boundary at an angle that is 
equal to (and in an opposite direction of) its initial angle of incidence. This 
basic property is one of the cornerstones of the complex study of acoustics. For 
example, Figure 2.8a shows how a sound wave reflects off a solid smooth surface 
in a simple and straightforward manner (at an equal and opposite angle). Figure 
2.8b shows how a convex surface will splay the sound outward from its surface, 
radiating the sound outward in a wide dispersión pattern. In Figure 2.8c, a 
concave surface is used to focus a sound inward toward a single point, while a 
90° córner (as shown in Figure 2.8d) reflects patterns back at angles that are 
equal to their original incident direction. This holds true both for the 90° 
corners of a wall and for intersections where the wall and floor meet. These 
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(a) 

FIGURE 2.8 


(b) 


FIGURE 2.7 

Wavelengths 
decrease in length as 
frequency increases 
(and vice versa). 




(c) (d) 


Incident sound waves striking surfaces with varying shapes: (a) single-planed, solid, smooth surface; (b) convex 
surface; (c) concave surface; (d) 90° córner reflection. 


córner reflections help to provide insights into how volume levels often build 
up in the corners of a room (particularly at wall-to-floor córner intersections). 

DIFFRACTION OF SOUND 

Sound has the inherent ability to diffract around or through a physical acoustic 
barrier. In other words, sound can bend around an object in a manner that 
reconstructs the signal back to its original form in both frequency and ampli- 
tude. For example, in Figure 2.9a, we can see how a small obstacle will scarcely 
impede a larger acoustic waveform. Figure 2.9b shows how a larger obstacle can 
obstruct a larger portion of the waveform; however, past the obstruction, the 
signal bends around the area in the barrier's wake and begins to reconstruct 
itself. Figure 2.9c shows how the signal is able to radiate through an opening 
in a large barrier. Although the signal is greatly impeded (relative to the size of 
the opening), it nevertheless begins to reconstruct itself in wavelength and 
relative amplitude and begins to radiate outward as though it were a new point 
of origin. Finally, Figure 2.9d shows how a large opening in a barrier lets much 
of the waveform pass through relatively unimpeded. 

FREQUENCY RESPONSE 

The charted output of an audio device is known as its frequency response curve 
(when supplied with a reference input of equal level over the 20- to 20,000-Hz 
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FIGURE 2.9 

The effects of 
obstacles on sound 
radiation and 
diffraction. (a) A small 
obstacle will scarcely 
impede a longer 
wavelength signal. 

(b) A larger obstacle 
will obstruct the 
signal to a greater 
extent; the waveform 
will also reconstruct 
itself in the barrier’s 
wake. (c) A small 
opening in a barrier 
will greatly impede a 
signal; the waveform 
will emanate from 
the opening and 
reconstruct itself as 
a new source point. 
(d) A larger opening 
allows sound to pass 
unimpeded, allowing 
it to quickly diffract 
back into its original 
shape. 






range of human hearing). This curve is used to graphically represent how a 
device will respond to the audio spectrum and, thus, how it will affect a signal's 
overall sound. As an example, Figure 2.10 shows the frequency response of 
several unidentified devices. In these and all cases, the x-axis represents the 
signal's measured frequency, while the y-axis represents the device's measured 
output signal. These curves are created by feeding the input of an acoustic or 
electrical device with a constant-amplitude reference signal that sweeps over the 
entire frequency spectrum. The results are then charted on an amplitude versus 
frequency graph that can be easily read at a glance. If the measured signal is the 
same level at all frequencies, the curve will be drawn as a fíat, straight line from 
left to right (known as a fíat frequency response curve). This indicates that the 
device passes all frequencies equally (with no frequency being emphasized or 
de-emphasized). If the output lowers or increases at certain frequencies, these 
changes will easily show up as dips or peaks in the chart. 

Phase 

Because we know that a cycle can begin at any point on a waveform, it follows 
that whenever two or more waveforms are involved in producing a sound, their 
relative amplitudes can (and often will) be different at any one point in time. 
For simplicity's sake, let's limit our example to two puré tone waveforms (sine 
waves) that have equal amplitudes and frequency ... but start their cyclic periods 
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(a) 

12 dB 

6 dB - 
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0 dB - 

-3 dB - 

-6 dB - 

-12 dB - 

20Hz 

(b) 

12 dB 


FIGURE 2.10 

Frequency response 
curves: (a) curve 
showing a bass 
boost; (b) curve 
showing a boost at 
the upper end; (c) 
curve showing a dip 
in the midrange. 
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3 dB 
O dB 
-3 dB 
-6 dB 
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20Hz 


1kHz 


20kHz 




at different times. Such waveforms are said to be out of phase with respect to 
each other. Variations in phase , which are measured in degrees (°), can be 
described as a time del ay between two or more waveforms. These delays are 
often said to have differences in relative phase degree angles (over the full rota- 
tion of a cycle, e.g., 90°, 180°, or any angle between 0 o and 360°). The sine 
wave (so named because its amplitude follows a trigonometric sine function) 
is usually considered to begin at 0 o with an amplitude of zero; the waveform 
then increases to a positive máximum at 90 °, decreases back to a zero amplitude 
at 180°, increases to a negative máximum valué at 270°, and finally returns 
back to its original level at 360°, simply to begin all over again. 

Whenever two or more waveforms arrive at a single location out of phase, their 
relative signal levels will be added together to create a combined amplitude 
level at that one point in time. Whenever two waveforms having the same fre¬ 
quency, shape and peak amplitude are completely in phase (meaning that they 
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have no relative time difference), the newly combined waveform will have the 
same frequency, phase and shape ... but will be double in amplitude (Figure 
2.1 la). If the same two waves are combined completely out of phase (having a 
phase difference of 180°), they will cancel each other out when added, which 
results in a straight line of zero amplitude (Figure 2.1 Ib). If the second wave is 
only partially out of phase (by a degree other than 180°), the levels will be 
added at points where the combined amplitudes are positive and reduced in 
level where the combined result is negative (Figure 2.11c). 

0 I Y 

do it yourself 


Tutorial: Phase 


1. Go to the “Tutorial” section of www.modrec.com, click 
on “Ch. 2—Phase Tutorial” and download the 0 o and 
180° soundfiles. 

2. Load the 0 o file onto track 1 of the digital audio 
workstation (DAW) of your choice, making sure to 
place the file at the beginning of the track, with the 
signal panned center. 

3. Load the same 0 o file again into track 2. 

4. Load the 180° file into track 3. 


5. Play tracks 1 and 2 (by muting track 3) and listen to 
the results. The result should be a summed signal that 
is 3 dB louder. 

6. Play tracks 1 and 3 (by muting track 2) and listen to 
the results. It should cancel, producing no output. 

7. Offsetting track 3 (relative to track 1) should produce 
varying degrees of cancellation. 

8. Feel free to zoom in on the waveforms, mix them 
down, and view the results. Cool, huh? 


FIGURE 2.11 

Combining sine 
waves of various 
phase relationships. 
(a) The amplitudes of 
in-phase waves 
increase in level 
when mixed 
together. (b) Waves 
of equal amplitude 
cancel completely 
when mixed 180° 
out of phase. (c) 
When partial phase 
angles are mixed, the 
combined signáis will 
add in certain places 
and subtract in 
others. 



(a) 
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FIGURE 2.11 

Continued 
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FIGURE 2.12 

Cancellations can 
occur when a single 
source is picked up 
by two microphones. 


PHA5E 5HIFT 

Phase shift is a term that describes one wave- 
form's lead or lag in time with respect to another. 
Basically, it results from a time delay between 
two (or more) waveforms (with differences in 
acoustic distance being the most common source 
of this type of delay). For example, a 500-Hz 
wave completes one cycle every 0.002 sec. Ifyou 
start with two in-phase, 500-Hz waves and delay 
one of them by 0.001 sec (half the wave's period), 
the delayed wave will lag the other by one-half 
a cycle, or 180°. Another example might inelude 
a single source that's being picked up by two microphones that have been placed 
at different distances (Figure 2.12), thereby creating a corresponding time delay 
when the mies are mixed together. Such a delay can also occur when a single 
microphone picks up direct sounds as well as those that are reflected off of a 
nearby boundary. These signáis will be in phase at frequencies where the path- 
length difference is equal to the signal's wavelength, and out of phase at those 
frequencies where the múltiples fall at or near the half-wavelength distance. In 
all the above situations, these boosts and cancellations combine to alter the 
signal's overall frequeney response at the pickup. For this and other reasons, 
acoustic leakage between microphones and reflections from nearby boundaries 
should be kept to a minimum whenever possible. 


Harmonic content 

Up to this point, the discussion has centered on the sine wave, which is com- 
posed of a single frequeney that produces a puré sound at a specific pitch. 
Fortunately, musical instruments rarely produce puré sine waves. If they did, all 
of the instruments would basically sound the same, and music would be pretty 
boring. The factor that helps us differentiate between instrumental "voicings" 
is the presence of frequencies (called partíais ) that exist in addition to the fun¬ 
damental pitch that's being played. Partials that are higher than the fundamen¬ 
tal frequeney are called upper partials or overtones. Overtone frequencies that are 
whole-number múltiples ofthe fundamental frequeney are called harmonics. For 
example, the frequeney that corresponds to concert A is 440 Hz (Figure 2.13a). 
An 880-Hz wave is a harmonic of the 440-Hz fundamental because it is twice 
the frequeney (Figure 2.13b). In this case, the 440-Hz fundamental is technically 
the first harmonic because it is 1 times the fundamental frequeney, and the 
880-Hz wave is called the second harmonic because it is 2 times the fundamen¬ 
tal. The third harmonic would be 3 times 440 Hz, or 1320 Hz (Figure 2.13c). 
Some instruments, such as bells, xylophones and other percussion instruments, 
will often contain overtone partials that aren't harmonically related to the fun¬ 
damental at all. 

The ear perceives frequencies that are whole, doubled múltiples of the funda¬ 
mental as being related in a special way (a phenomenon known as the musical 
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(a) (b) (c) 


FIGURE 2.13 

An illustration of harmonios: (a) first harmonio “fundamental waveform”; (b) second harmonio; (c) third harmonio. 


octave). For example, as concert A is 440 Hz (A3), the ear hears 880 Hz (A4) as 
being the next highest frequency that sounds most like concert A. The next 
related octave above that will be 1760 Hz (A5). Therefore, 880 Hz is said to be 
one octave above 440 Hz, and 1760 Hz is said to be two octaves above 440 Hz, 
etc. Because these frequencies are even múltiples of the fundamental, they're 
known as even harmonios. Not surprisingly, frequencies that are odd múltiples 
of the fundamental are called odd harmonios. In general, even harmonics are 
perceived as creating a sound that is pleasing to the ear, while odd harmonics 
will create a dissonant, harsher tone. 


C D I Y 1 

do it yourself 

Tutorial: 

1. Go to the “Tutorial” section of www.modrec.com, click 
on “Ch. 2—Harmonics Tutorial,” and download all of 
the soundfiles. 

2. Load the first-harmonic a440 file onto track 1 of the 
digital audio workstation (DAW) of your choice, making 
sure to place the file at the beginning of the track, with 
the signal panned center. 

3. Load the second-, third-, fourth- and fifth-harmonic 
files into the next set of consecutive tracks. 


Harmonics 

4. Solo the first-harmonic track, then solo the first- and 
second-harmonic tracks. Do they sound related in 
nature? 

5. Solo the first-harmonic track, then solo the first- and 
third-harmonic tracks. Do they sound more dissonant? 

6. Solo the first-, second- and third-harmonic tracks. Do 
they sound related? 

7. Solo the first-, third- and fifth-harmonic tracks. Do 
they sound more dissonant? 
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FIGURE 2.14 

Simple waveforms: 

(a) square waves; 

(b) triangle waves; 

(c) sawtooth waves 



FIGURE 2.15 

Example of a 
complex waveform. 


Because musical instruments produce sound 
waves that contain harmonics with various 
amplitude and phase relationships, the resulting 
waveforms bear little resemblance to the shape 
of the single-frequency sine wave. Therefore, 
musical waveforms can be divided into two cat- 
egories: simple and complex. Square waves, tri¬ 
angle waves and sawtooth waves are examples of 
simple waves that contain a consistent harmonic 
structure (Figure 2.14). They are said to be simple 
because they're continuous and repetitive in 
nature. One cycle of a square wave looks exactly 
like the next, and they are symmetrical about the 
zero line. 

Complex waves, on the other hand, don't neces- 
sarily repeat and often are not symmetrical about 
the zero line. An example of a complex waveform 
(Figure 2.15) is one that's created by any natu- 
rally occurring sound (such as music or speech). 
Although complex waves are rarely repetitive in 
nature, all sounds can be mathematically broken 
down as being an ever-changing combination of 
individual sine waves. 

Regardless of the shape or complexity of a wave¬ 
form that reaches the eardrum, the inner ear is 
able to perceive these component waveforms and 
transmit the stimulus to the brain. This can be 
illustrated by passing a square wave through a 
bandpass filter that's set to pass only a narrow 
band of frequencies at any one time. Doing this 
would show that the square wave is composed of 
a fundamental frequency plus a number of har¬ 
monics that are made up of odd-number múlti¬ 
ple frequencies (whose amplitudes decrease as 
the frequency increases). In Figure 2.16, we see 
how individual sine-wave harmonics can be 
combined to form a square wave. 

If we were to analyze the harmonic content of 
sound waves that are produced by a violin and 
compare them to the content of the waves that 
are produced by a viola (with both playing 
concert A, 440 Hz), we would come up with 
results like those shown in Figure 2.17. Notice 
that the violin's harmonics differ in both degree 
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FIGURE 2.16 

Breaking a square 
wave down into its 
odd-harmonic 
components: 

(a) square wave with 
frequency f\ (b) sine 
wave with frequency 
f\ (c) sum of a sine 
wave with frequency 
f and a lower 
amplitude sine wave 
of frequency 3 f\ 

(d) sum of a sine 
wave of frequency f 
and lower amplitude 
sine waves of 3 f 
and 5 f, which is 
beginning to 
resemble a square 
wave. 



(a) (b) 


FIGURE 2.17 

Harmonio structure of concert A-440: (a) played on a viola; (b) played on a violin. 






















































































































Waveform Characteristics 


and intensity from those of the viola. The harmonics and their relative intensi- 
ties (which determine an instrument's characteristic sound) are called the timbre 
of an instrument. If we changed an instrument's harmonio balance, the sonic 
character of the instrument would also be changed. For example, if the violin's 
upper harmonics were reduced, the violin would sound a lot like the viola. 

Because the relative harmonic balance is so important to an instrument's sound, 
the frequency response of a microphone, amplifier, speaker and all other ele- 
ments in the signal path can have an effect on the timbre (tonal balance) of a 
sound. If the frequency response isn't fíat, the timbre of the sound will be 
changed. For example, if the high frequencies are amplified less than the low 
and middle frequencies, then the sound will be duller than it should be. For 
this reason, a specific mic, mic placement or an equalizer can be used as tools 
to vary the timbre of an instrument, thereby changing its subjective sound. 

In addition to the variations in harmonic balance that can exist between instru- 
ments and their families, it is common for the harmonic balance to vary with 
respect to direction as sound waves radiate from an instrument. Figure 2.18 
shows the principal radiation patterns as they emanate from a cello (as seen 
from both the side and top views). 

Envelope 

Timbre isn't the only characteristic that lets us differentiate between instru- 
ments. Each one produces a sonic envelope that works in combination with 
timbre to determine its unique and subjective sound. The envelope of a wave¬ 
form can be described as characteristic variations in level that occur in time 


FIGURE 2.18 

Radiation patterns of 
a cello as viewed 
from the side (left) 
and top (right). 
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over the duration of a played note. The envelope 
of an acoustic or electronically generated signal 
is composed of four sections that vary in ampli- 
tude over time: 

■ Attack refers to the time taken for a sound 
to build up to its full volume when a note 
is initially sounded. 

■ Decay refers to how quickly the sound 
levels off to a sustain level after the initial 
attack peak. 

■ Sustain refers to the duration of the ongoing 
sound that's generated following the initial 
attack decay. 

■ Release relates to how quickly the sound 
will decay once the note is released. 

Figure 2.19a illustrates the envelope of a trom- 
bone note. The attack, decay times and internal 
dynamics produce a smooth, sustaining sound. 
A cymbal crash (Figure 2.19b) combines a high- 
level, fast attack with a longer sustain and decay 
that creates a smooth, lingering shimmer. Figure 
2.19c illustrates the envelope of a snare drum. 
Notice that the initial attack is much louder than 
the internal dynamics ... while the final decay 
trails off very quickly, resulting in a sharp, per- 
cussive sound. 


A D 


R 



(a) 



(b) 



It's important to note that the concept of an 
envelope relies on peak waveform valúes, while 
the human perception of loudness is propor- 
tional to the average wave intensity over a period 
of time (rms valué). Therefore, high-amplitude 
portions of the envelope won't make an instru- 
ment sound loud unless the amplitude is main- 
tained for a sustained period. Short high-amplitude sections tend to contribute 
to a sound's overall character, rather than to its loudness. By using a compressor 
or limiter, an instrumenté character can often be modified by changing the 
dynamics of its envelope without changing its timbre. 


(c) 


LOUDNESS LEVELS: THE DECIBEL 

The ear operates over an energy range of approximately 10 13 :1 
(10,000,000,000,000:1), which is an extremely wide range. Since it's difficult 
for us humans to conceptualize number ranges that are this large, a logarithmic 
scale has been adopted to compress the measurements into figures that are more 


FIGURE 2.19 

Various musical 
waveform envelopes: 

(a) trombone, 

(b) cymbal crash, 
and (c) snare drum, 
where A = attack, 

D = decay, 

S = sustain, and 
R = release. 






Loudness Levels: The Decibel 


FIGURE 2.20 

Linear and 
logarithmic curves: 

(a) linear; 

(b) logarithmic. 





input level 



input level 


manageable. The unit used for measuring sound-pressure level (SPL), signal 
level and relative changes in signal level is the decibel ( dB ), a term that literally 
means 1/1 Oth of a Bell ... a telephone transmission measurement unit that was 
named after Alexander Graham Bell, inventor of the telephone. In order to 
develop an understanding of the decibel, we first need to examine logarithms 
and the logarithmic scale (Figure 2.20). The logarithm ( log ) is a mathematical 
function that reduces large numeric valúes into smaller, more manageable 
numbers. Because logarithmic numbers increase exponentially in a way that's 
similar to how we perceive loudness (e.g., 1, 2, 4, 16, 128, 256, 65,536, ...), it 
expresses our perceived sense of volume more precisely than a linear curve can. 

Before we delve into a deeper study of this important concept and how it deais 
with our perceptual senses, let's take a moment to understand the basic concepts 
and building block ideas behind the log scale, so as to get a better understand¬ 
ing of what examples such as "+3 dB at 10,000 Hz" really mean. Be patient with 
yourself! Over time, the concept of the decibel will become as much a part of 
your working vocabulary as ounces, gallons and miles per hour. 

Logarithmic basics 

In audio, we use logarithmic valúes to express the differences in intensities 
between two levels (often, but not always, comparing a measured level to a 
standard reference level). Because the differences between these two levels can 
be really, really big, a simpler system would make use of expressed valúes that 
are mathematical exponents of 10. To begin, finding the log of a number such 
as 17,386 without a calculator is not only difficult... it's unnecessary! All that's 
really important to help you along are three simple guidelines: 

■ The log of the number 2 is 0.3. 

■ When a number is an integral power of 10 (e.g., 100, 1000, 10,000), the 
log can be found simply by adding up the number of zeros. 
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■ Numbers that are greater than 1 will have a positive log valué, while those 
less than 1 will have a negative log valué. 

The first one is an easy fact to remember: The log of 2 is 0.3 ... this will make 
sense shortly. The second one is even easier: The logs of numbers such as 100, 
1000 or 10,000,000,000,000 can be arrived at by simply counting up the zeros. 
The last guideline relates to the fact that if the measured valué is less than the 
reference valué, the resulting log valué will be negative. For example: 

log2 = 0.3 

log 1/2 = log0.5 = -0.3 
logl0,000,000,000,000 = 13 

loglOOO = 3 
log100 = 2 
log10 = 1 
logl = 0 
logO.l = -1 
logO.Ol = -2 
logO.001 = -3 


All other numbers can be arrived at by using a scientific calculator (most com- 
puters and many cell phones have one built in); however, it's unlikely that you 
will ever need to know any log valúes beyond understanding the basic concepts 
that are listed above. 

The decibel 

Now that we've gotten past the absolute bare basics, I'd like to break with tradi- 
tion again and attempt an explanation of the decibel in a way that's less complex 
and relates more to our day-to-day needs in the sound biz. First off, the decibel 
is a logarithmic valué that "expresses differences in intensities between two 
levels." From this, we can infer that these levels are expressed by several units 
of measure, the most common being sound-pressure level (SPL), voltage (V) 
and power (wattage, or W). Now, let's look at the basic math behind these three 
measurements. 

SOUND-PRESSURE LEVEL 

Sound-pressure level is the acoustic pressure that's built up within a defined 
atmospheric area (usually a square centimeter, or cm 2 ). Quite simply, the higher 
the SPL, the louder the sound (Figure 2.21). In this instance, our measured 
reference (SPL re f) is the threshold of hearing, which is defined as being the 
softest sound that an average person can hear. Most conversations will have an 
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SPL of about 70 dB, while average home stereos are played at volumes ranging 
between 80 and 90 dB SPL. Sounds that are so loud as to be painful have SPLs 
of about 130 to 140 dB (10,000,000,000,000 or more times louder than the 
0-dB reference). We can arrive at an SPL rating by using the formula: 

dB SPL = 20log SPL/SPL ref 


where SPL is the measured sound pressure (in dyne/cm 2 ). SPL ref is a reference 
sound pressure (the threshold limit of human hearing, 0.02 millipascals = 2 
ten-billionths of our atmosphere). 


FIGURE 2.21 

Chart of sound- 
pressure levels. 

(Courtesy of General 
Radio Company.) 

Typical A-Weighted Sound Levels dB re: 20uN/m 2 
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FIGURE 2.22 

Doubling the distance 
of a pickup will lower 
the perceived direct 
signal level by 6 dB 
SPL. 


From this, I feel that the major concept that 
needs to be understood is the idea that SPL levels 
change with the square of the distance (henee, 
the 20 log part of the equation). This means that 
whenever a source/pickup distance is doubled, 
the SPL level will be reduced by 6 dB (20 log 
0.5/1 = 20 x -0.3 = -6 dB SPL); as the distance 
is halved, it will increase by 6 dB (20 log 2/1 = 
20 x 0.3 = 6 dB SPL), as shown in Figure 2.22. 

VOLTAGE 

Voltage can be thought of as the pressure behind 
electrons within a wire. As with acoustic energy, 
comparing one voltage level to another level (or 
reference level) can be expressed as dBv using the 
equation: 

dBv = 20 log V/V re f 

where V is the measured voltage, and V re f is a 
reference voltage (0.775 volts). 

POWER 

Power is usually a measure of wattage or current 
and can be thought of as the flow of electrons 
through a wire over time. Power is generally 
associated with audio signáis that are carried 
throughout an audio production system. Unlike 
SPL and voltage, the equation for signal level 
(which is often expressed in dBm) is: 



2d (6dB lower In level) 
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dBm = lOlog P/P re f 

where P is the measured wattage, and P re f is referenced to 1 milliwatt (0.001 
watt). 

The simple heart of the matter 

I am going to stick my neck out and State that, when dealing with decibels, it's 
far more common for working professionals to deal with the concept of power. 
The dBm equation expresses the spirit of the decibel term when dealing with 
the markings on an audio device or the numeric valúes in a Computer dialog 
box. This is due to the fact that power is the unit of measure that's most often 
expressed when dealing with audio equipment Controls; therefore, it's my per¬ 
sonal opinión that the average working stifif only needs to grasp the following 
basic concepts: 

■ A 1-dB change is barely noticeable to most ears. 

■ Turning something up by 3 dB will double the signal's level (believe it or 
not, doubling the signal level won't increase the perceived loudness as 
much as you might think). 

■ Turning something down by 3 dB will halve the signal's level (likewise, 
halving the signal level won't decrease the perceived loudness as much as 
you might think). 

■ The log of an exponent of 10 can be easily figured by simply counting the 
zeros (e.g., the log of 1000 is 3). Given that this figure is multiplied by 10 
(10 log P/P re f), turning something up by 10 dB will increase the signal's 
level 10-fold, 20 dB will yield a 100-fold increase, 30 dB will yield a 1000- 
fold increase, etc. 

Most pros know that turning a level fader up by 3 dB will effectively double its 
energy output (and vice versa). Beyond this, it's unlikely that anyone will ever 
ask, "Would you please turn that up a thousand times?" It just won't happen! 
However, when a pro asks his or her assistant to turn the gain up by 20 dB, 
that assistant will often instinctively know what 20 dB is ... and what it sounds 
like. I guess I'm saying that the math really isn't nearly as important as the 
ongoing process of getting an instinctive feel for the decibel and how it relates 
to relative levels within audio production. 


THE EAR 

A sound source produces acoustic waves by alternately compressing and rarefy- 
ing the air molecules between it and the listener, causing fluctuations that fall 
above and below normal atmospheric pressure. The human ear is a sensitive 
transducer that responds to these pressure variations by way of a series of related 
processes that occur within the auditory organs ... our ears. When these varia¬ 
tions arrive at the listener, sound-pressure waves are collected in the aural canal 
by way of the outer ear's pinna. These are then directed to the eardrum, a 



The Ear 


FIGURE 2.23 

Outer, middle, and 
inner ear. 



stretched drum-like membrane (Figure 2.23), where the sound waves are 
changed into mechanical vibrations, which are transferred to the inner ear by 
way of three bones known as the hammer, anvil and stirrup. These bones act 
both as an amplifier (by significantly increasing the vibrations that are transmit- 
ted from the eardrum) and as a limiting protection device (by reducing the level 
of loud, transient sounds such as thunder or fireworks explosions). The vibra¬ 
tions are then applied to the inner ear (cochlea)—a tubular, snail-like organ 
that contains two fluid-filled chambers. Within these chambers are tiny hair 
receptors that are lined up in a row along the length of the cochlea. These hairs 
respond to certain frequencies depending on their placement along the organ, 
which results in the neural stimulation that gives us the sensation of hearing. 
Permanent hearing loss generally occurs when these hair/nerve combinations 
are damaged or as they deteriórate with age. 

Threshold of hearing 

In the case of SPL, a convenient pressure-level reference is the threshold of 
hearing , which is the minimum sound pressure that produces the phenomenon 
of hearing in most people and is equal to 0.0002 microbar. One microbar is 
equal to 1 millionth of normal atmospheric pressure, so it's apparent that the 
ear is an amazingly sensitive instrument. In fact, if the ear were any more sensi- 
tive, the thermal motion of molecules in the air would be audible! When ref- 
erencing SPLs to 0.0002 microbar, this threshold level usually is denoted as 
0 dB SPL, which is defined as the level at which an average person can hear a 
specific frequency only 50% of the time. 
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Threshold of feeling 

An SPL that causes discomfort in a listener 50% of the time is called the thresh¬ 
old of feeling. It occurs at a level of about 118 dB SPL between the frequencies 
of 200 Hz and 10 kHz. 

Threshold of pain 

The SPL that causes pain in a listener 50% of the time is called the threshold of 
pain and corresponds to an SPL of 140 dB in the frequency range between 
200 Hz and 10 kHz. 

Taking care of your hearing 

During the 1970s and early 1980s, recording studio monitoring levels were 
often turned so high as to be truly painful. In the mid-1990s, a small band of 
powerful producers and record executives banded together to successfully 
reduce these average volumes down to tolerable levels (85 to 95 dB) ... a prac- 
tice that continúes to this day. Live sound venues and acts often continué the 
practice of raising house and stage volumes to chest-thumping levels. Although 
these levels are exciting, long-term exposure can lead to temporary or perma- 
nent hearing loss. So what types of hearing loss are there? 

■ Acoustic trauma: This happens when the ear is exposed to a sudden, loud 
noise in excess of 140 dB. Such a shock could lead to permanent hearing 
loss. 

■ Temporary threshold shift: The ear can experience temporary hearing loss 
when exposed to long-term, loud noise. 

■ Permanent threshold shift: Extended exposure to loud noises in a specific or 
broad hearing range can lead to permanent hearing loss in that range. In 
short, the ear becomes less sensitive to sounds in the damaged frequency 
range leading to a reduction in perceived volume. ... What? 

Here are a few hearing conservation tips (courtesy of the House Ear Institute, 
www.hei.org) that can help reduce hearing loss due to long-term exposure of 
sounds over 115 dB: 

■ Avoid hazardous sound environments; if they are not avoidable, wear 
hearing protection devices, such as foam earplugs, custom-molded ear- 
plugs, or in-ear monitors. 

■ Monitor sound-pressure levels at or around 85 dB. The general rule to 
follow is if you're in an environment where you must raise your voice to 
be heard, then you're monitoring too loudly and should limit your expo¬ 
sure times. 

■ Take 15-minute "quiet breaks" every few hours if you're being exposed to 
levels above 85 dB. 

■ Musicians and other live entertainment professionals should avoid prac- 
ticing at concert-hall levels whenever possible. 

■ Have your hearing checked by a licensed audiologist. 


Psychoacoustics 


PSYCHOACOUSTICS 

The area of psychoacoustics deais with how and why the brain interprets a par¬ 
ticular sound stimulus in a certain way. Although a great deal of study has been 
devoted to this subject, the primary device in psychoacoustics is the all-elusive 
brain ... which is still largely unknown to present-day Science. 

AUDITORY PERCEPTION 

From the outset, it's important to realize that the ear is a nonlinear device 
(what's received at your ears isn't always what you'll hear). It's also important 
to note that the ear's frequency response (its perception of timbre) changes with 
the loudness of the perceived signal. The loudness compensation switch found 
on many hi-fi preamplifiers is an attempt to compénsate for this decrease in the 
ear's sensitivity to low- and high-frequency sounds at low listening levels. 

The Fletcher-Munson equal-loudness contour curves (Figure 2.24) indicate the ear's 
average sensitivity to different frequencies at various levels. These indicate the 
sound-pressure levels that are required for our ears to hear frequencies along 
the curve as being equal in level to a 1000-Hz reference level (measured in 
phons). Thus, to equal the loudness of a 1-kHz tone at 110 dB SPL (a level 
typically created by a trumpet-type car horn at a distance of 3 feet), a 40-Hz 


FIGURE 2.24 

The Fletcher-Munson 
curve shows an 
equal loudness 
contour for puré 
tones as perceived 
by humans having an 
average hearing 
acuity. These 
perceived loudness 
levels are charted 
relative to sound- 
pressure levels at 
1000 Hz. 
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tone has to be about 6 dB louder, whereas a 10-kHz tone must be 4 dB louder 
in order to be perceived as being equally loud. At 50 dB SPL (the noise level 
present in the average prívate business office), the level of a 40-Hz tone must 
be 30 dB louder and a 10-kHz tone 13 dB louder than a 1-kHz tone to be per¬ 
ceived as having the same volume. Thus, if a piece of music is mixed to sound 
great at a level of 85 to 95 dB, its bass and treble balance will actually be boosted 
when turned up (often a good thing). If the same piece were mixed at 110 dB 
SPL, it would sound both bass and treble shy when played at lower levels ... 
because no compensation for the ear's response was added to the mix. Over the 
years, it has generally been found that changes in apparent frequency balance 
are less apparent when monitoring at levels of 85 dB SPL. 

In addition to the above, whenever it is subjected to sound waves that are above 
a certain loudness level, the ear can produce harmonic distortion that doesn't 
exist in the original signal. For example, the ear can cause a loud 1-kHz sine 
wave to be perceived as being a combination of 1-, 2-, 3-kHz waves, and so on. 
Although the ear might hear the overtone structure of a violin (if the listening 
level is loud enough), it might also perceive additional harmonics (thus chang- 
ing the timbre of the instrument). This is one of several factors that implies that 
sound monitored at very loud levels could sound quite different when played 
back at lower levels. 

The loudness of a tone can also affect our ear's perception of pitch. For example, 
if the intensity of a 100-Hz tone is increased from 40 to 100 dB SPL, the ear 
will hear a pitch decrease of about 10%. At 500 Hz, the pitch will change about 
2% for the same increase in sound-pressure level. This is one reason why musi- 
cians find it difficult to tune their instruments when listening through loud 
headphones. 

As a result of the nonlinearities in the ear's response, tones will often interact 
with each other rather than being perceived as being sepárate. Three types of 
interaction effects can occur: 

■ Beats 

■ Combination tones 

■ Masking. 


Beats 

Two tones that differ only slightly in frequency and have approximately 
the same amplitude will produce an effect known as beats. This effect sounds 
like repetitive volume surges that are equal in frequency to the difference 
between these two tones. The phenomenon is often used as an aid for tuning 
instruments, because the beats slow down as the two notes approach the same 
pitch and finally stop when the pitches match. In reality, beats are a result 
of the ear's inability to sepárate closely pitched notes. This results in a third 
frequency that's created from the phase sum and difference valúes between the 
two notes. 


Auditory Perception 


d i y y 

do it yourself 

Tutorial: Beats 


Go to the “Tutorial” section of www.modrec.com, click 
on “Ch. 2—Beats Tutorial,” and download all of the 
soundfiles. 

Load the 440-Hz file onto track 1 of the digital audio 
workstation (DAW) of your choice, making sure to 
place the file at the beginning of the track, with the 
signal panned center. 

Load the 445- and 450-Hz files into the next two 
consecutive tracks. 

Solo and play the 440-Hz tone. 


5. Solo both the 440- and 445-Hz tones and listen to 
their combined results. Can you hear the 5-Hz beat 
tone? (445 Hz - 440 Hz = 5 Hz) 

6. Solo both the 445- and 450-Hz tones and listen to 
their combined results. Can you hear the 5-Hz beat 
tone? (450 Hz - 445 Hz = 5 Hz) 

7. Now, solo both the 440- and 450-Hz tones and listen 
to their combined results. Can you hear the 10-Hz 
beat tone? (450 Hz - 440 Hz = 10 Hz) 


Combination tones 

Combination tones result when two loud tones differ by more than 50 Hz. In this 
case, the ear perceives an additional set of tones that are equal to both the sum 
and the difference between the two original tones ... as well as being equal to 
the sum and difference between their harmonics. The simple formulas for com- 
puting the fundamental tones are: 

sum tone = f\ + f 2 
difference tone = /, - f 2 

Difference tones can be easily heard when they are below the frequency of both 
tones' fundamentáis. For example, the combination of 2000 and 2500 Hz pro¬ 
duces a difference tone of 500 Hz. 

Masking 

Masking is the phenomenon by which loud signáis prevent the ear from hearing 
softer sounds. The greatest masking effect occurs when the frequency of the 
sound and the frequency of the masking noise are cióse to each other. For 
example, a 4-kHz tone will mask a softer 3.5-kHz tone but has little effect on 
the audibility of a quiet 1000-Hz tone. Masking can also be caused by harmon¬ 
ics of the masking tone (e.g., a 1-kHz tone with a strong 2-kHz harmonic might 
mask a 1900-Hz tone). This phenomenon is one of the main reasons why stereo 
placement and equalization are so important to the mixdown process. An 
instrument that sounds fine by itself can be completely hidden or changed in 
character by louder instruments that have a similar timbre. Equalization, mic 
choice or mic placement might have to be altered to make the instruments 
sound different enough to overeóme any masking effect. 
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do it yourself 

Tutorial: Masking 


1. Go to the “Tutorial” section of www.modrec.com, click 
on “Ch. 2—Masking Tutorial” and download all of the 
soundfiles. 

2. Load the 1000-Hz file onto track 1 of the digital audio 
workstation (DAW) of your choice, making sure to 
place the file at the beginning of the track, with the 
signal panned center. 

3. Load the 3800- and 4000-Hz files into the next two 
consecutive tracks. 


Solo and play the 1000-Hz tone. 

Solo both the 1000- and the 4000-Hz tones and listen 
to their combined results. Can you hear both of the 
tones clearly? 

6. Solo and play the 3800-Hz tone. 

7. Solo both the 3800- and the 4000-Hz tones and listen 
to their combined results. Can you hear both of the 
tones clearly? 


4. 

5. 


PERCEPTION OF DIRECTION 

Although one ear can't discern the direction of a sound's origin, two ears can. 
This capability of two ears to localize a sound source within an acoustic space 
is called spatial or binaural localization. This effect is the result of three acoustic 
cues that are received by the ears: 

■ Interaural intensity differences 

■ Interaural arrival-time differences 

■ The effects of the pinnae (outer ears). 

Middle to higher frequency sounds originating from the right side will reach 
the right ear at a higher intensity level than the left ear, causing an interaural 
intensity difference. This volume difference occurs because the head casts an 
acoustic block or shadow, allowing only reflected sounds from surrounding 
surfaces to reach the opposite ear (Figure 2.25). Because the reflected sound 
travels farther and loses energy at each reflection—in our example the intensity 
of sound perceived by the left ear will be greatly reduced, resulting in a signal 
that's perceived as originating from the right. 

This effect is relatively insignificant at lower frequencies, where wavelengths are 
large compared to the head's diameter, allowing the wave to easily bend around 
its acoustic shadow. For this reason, a different method of localization (known 
as interaural arrival-time differences ) is employed at lower frequencies (Figure 
2.26). In both Figures 2.25 and 2.26, small time differences occur because the 
acoustic path length to the left ear is slightly longer than the path to the right 
ear. The sound pressure therefore arrives at the left ear at a later time than the 
right. This method of localization (in combination with interaural intensity 
differences) helps to give us lateral localization cues over the entire frequency 
spectrum. 


Perception of Direction 



acoustic 

shadow 

FIGURE 2.25 

The head casis an 
acoustic shadow that 
helps with localiza- 
tion at middle to 
upper frequencies. 



FIGURE 2.26 

Interaural arrival-time 
differences occurring 
at lower frequencies. 



FIGURE 2.27 

The pinna and its 
reflective ridges for 
determining vertical 
location information. 


Intensity and delay cues allow us to perceive the direc¬ 
tion of a sound's origin but not whether the sound 
originates from the front, behind or below. The pinna 
(Figure 2.27), however, makes use of two ridges that 
reflect sound into the ear. These ridges introduce minute 
time delays between the direct sound (which reaches the 
entrance of the ear canal) and the sound that's reflected 
from the ridges (which varíes according to source loca¬ 
tion). It's interesting to note that beyond 130° from the 
front of our face, the pinna is able to reflect and delay 
sounds by 0 and 80 microseconds (¡asee), making rear 
localization possible. Ridge 2 (see Figure 2.27) has been 
reported to produce delays of between 100 and 330 ¡asee 
that help us to lócate sources in the vertical plañe. The 
delayed reflections from both ridges are then combined 
with the direct sound to produce frequeney-response 
colorations that are compared within the brain to deter¬ 
mine source location. Small movements of the head can 
also provide additional position information. 

If there are no differences between what the left and 
right ears hear, the brain assumes that the source is the 
same distance from each ear. This phenomenon allows 
us to position sound not only in the left and right loud- 
speakers but also monophonically between them. If the 
same signal is fed to both loudspeakers, the brain per- 
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ceives the sound identically in both ears and deduces that the source must be 
originating from directly in the center. By changing the proportion that's sent 
to each speaker, the engineer changes the relative interaural intensity differences 
and thus creates the illusion of physical positioning between the speakers. This 
placement technique is known as panning (Figure 2.28). 


PERCEPTION OF SPACE 

In addition to perceiving the direction of sound, 
the ear and brain combine to help us perceive 
the size and physical characteristics of the acous- 
tic space in which a sound occurs. When a sound 
is generated, a percentage reaches the listener 
directly, without encountering any obstacles. A 
larger portion, however, is propagated to the 
many surfaces of an acoustic enclosure. If these 
surfaces are reflective, the sound is bounced back 
into the room and toward the listener. If the 
surfaces are absorptive, less energy will be 
reflected back to the listener. Three types of 
reflections are commonly generated within an 
enclosed space (Figure 2.29): 





center 



FIGURE 2.28 

Pan pot settings and 
their relative spatial 
positions. 



mid right 






FIGURE 2.29 

The three soundfield 
types that are 
generated within an 
enclosed space. 
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Perception of Space 


■ Direct sound 

■ Early reflections 

■ Reverberation. 

Direct sound 

In air, sound travels at a constant speed of about 1130 feet per second, so a wave 
that travels from the source to the listener will follow the shortest path and 
arrive at the listener's ear first. This is called the direct sound. Direct sounds 
determine our perception of a sound source's location and size and conveys the 
true timbre of the source. 

Early reflections 

Waves that bounce off of surrounding surfaces in a room must travel further 
than direct sound to reach the listener and therefore arrive after the direct sound 
and from a multitude of directions. These waves form what are called early 
reflections. Early reflections give us clues as to the reflectivity, size and general 
nature of an acoustic space. These sounds generally arrive at the ears less than 
50 msec after the brain perceives the direct sound and are the result of reflec¬ 
tions off of the largest, most prominent boundaries within a room. The time 
elapsed between hearing the direct sound and the beginning of the early reflec¬ 
tions helps to provide information about the size of the performance room. 
Basically, the farther the boundaries are from the source and listener, the longer 
the delay before it's reflected back to the listener. 

Another aspect that occurs with early reflections is called temporal fusión. Early 
reflections arriving at the listener within 30 msec of the direct sound are not 
only audibly suppressed, but are also fused with the direct sound. In effect, the 
ear can't distinguish the closely occurring reflections and considers them to be 
part of the direct sound. The 30-msec time limit for temporal fusión isn't abso- 
lute; rather, it depends on the sound's envelope. Fusión breaks down at 4 msec 
for transient clicks, whereas it can extend beyond 80 msec for slowly evolving 
sounds (such as a sustained organ note or legato violin passage). Despite the 
fact that the early reflections are suppressed and fused with the direct sound, 
they still modify our perception of the sound, making it both louder and fuller. 

Reverberation 

Whenever room reflections continué to bounce off of room boundaries, a ran- 
domly decaying set of sounds can often be heard after the source stops in the 
form of reverberation. A highly reflective surface absorbs less of the wave energy 
at each reflection and allows the sound to persist longer after the initial sound 
stops (and vice versa). Sounds reaching the listener 50-msec later in time are 
perceived as a random and continuous stream of reflections that arrive from all 
directions. These densely spaced reflections gradually decrease in amplitude and 
add a sense of warmth and body to a sound. Because it has undergone múltiple 
reflections, the timbre of the reverberation is often quite different from the 
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direct sound (with the most notable difference being a roll-off of high frequen- 
cies and a slight bass emphasis). 

The time it takes for a reverberant sound to decrease to 60 dB below its original 
level is called its decay time or reverb time and is determined by the room's 
absorption characteristics. The brain is able to perceive the reverb time and 
timbre of the reverberation and uses this information to form an opinión on 
the hardness or softness of the surrounding surfaces. The loudness of the per- 
ceived direct sound increases rapidly as the listener moves closer to the source, 
while the reverberation levels will often remain the same, because the diffusion 
is roughly constant throughout the room. This ratio of the direct sound's loud¬ 
ness to the reflected sound's level helps listeners judge their distance from the 
sound source. 

Whenever artificial reverb and delay units are used, the engineer can generate 
the necessary cues to convince the brain that a sound was recorded in a huge, 
stone-walled cathedral—when, in fact, it was recorded in a small, absorptive 
room. To do this, the engineer programs the device to mix the original unrever- 
berated signal with the necessary early delays and random reflections. Adjusting 
the number and amount of delays on an effects processor gives the engineer 
control over all of the necessary parameters to determine the perceived room 
size, while decay time and frequency balance can help to determine the room's 
perceived surfaces. By changing the proportional mix of direct-to-processed 
sound, the engineer/producer can place the sound source at either the front or 
rear of the artificially created space. 

DOUBLING 

By repeating a signal using a short delay of 4 to 20 msec (or so), the brain can 
be fooled into thinking that the apparent number of instruments being played 
is doubled. This process is called doubling. Often, acoustic doubling and tripling 
can be physically re-created during the overdub phase by recording a track and 
then going back and laying down one or more passes while the musicians listen 
to the original track. When this isn't possible, delay devices can be cost effec- 
tively and easily used to simúlate this effect. If a longer delay is chosen (more 
than about 35 msec), the repeat will be heard as discrete echoes, causing the 
delay (or series of repeated delays) to create a slap echo or slap back. This and 
other effects can be used to double or thicken up a sound ... anybody want 
vocals that sound like a 1950s pop star? 
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CHAPTER 3 


Studio Acoustics 
and Design 


The Audio Cyclopedia defines the term acoustics as "a Science dealing with the 
production, effects and transmission of sound waves; the transmission of sound 
waves through various médiums, including reflection, refraction, diffraction, 
absorption and interference; the characteristics of auditoriums, theaters and 
studios, as well as their design." We can see from this description that the proper 
acoustic design of music recording, project and audio-for-visual or broadcast 
studios is no simple matter. A wide range of complex variables and interrela- 
tionships often come into play in the creation of a successful acoustic and 
monitoring design. When designing or redesigning an acoustic space, the fol- 
lowing basic requirements should be considered: 

■ Acoustic isolation: This prevenís external noises from transmitting into the 
studio environment through the air, ground or building structure. It can 
also prevent feuds that can arise when excessive volume levels leak out 
into the surrounding neighborhood. 

■ Frequency balance: The frequency components of a room shouldn't 
adversely affect the acoustic balance of instruments and/or speakers. 
Simply stated, the acoustic environment shouldn't alter the sound quality 
of the original or recorded performance. 

■ Acoustic separation: The acoustic environment should not interfere with 
intelligibility and should offer the highest possible degree of acoustic 
separation within the room (often a requirement for ensuring that sounds 
from one instrument aren't unduly picked up by another instrument's 
microphone). 

■ Reverberation: The control of sonic reflections within a space is an impor- 
tant factor for maximizing the intelligibility of music and speech. No 
matter how short the early reflections and reverb times are, they will 
add an important psychoacoustic sense of "space" in the sense that 
they can give our brain subconscious cues as to a room's size, number 
of reflective boundaries, distance between the source and listener, and 
so forth. 

■ Cost factors: Not the least of all design and construction factors is 
cost. Multimillion-dollar facilities often employ studio designers and 
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construction teams to create a plush decor that has been acoustically tuned 
to fit the needs of both the owners and their clients. Owners of project 
studios and budget-minded production facilities, however, can also take 
full advantage of the same basic acoustic principies and construction 
techniques and apply them in cost-effective ways. 

This chapter will discuss many of the basic acoustic principies and construction 
techniques that should be considered in the design of a music or sound produc¬ 
tion facility. I'd like to emphasize that any or all of these acoustical topics can 
be applied to any type of audio production facility and aren't only limited to 
professional music studio designs. For example, owners of modest project and 
bedroom studios should know the importance of designing a control room 
that's symmetrical. It doesn't cost anything to know that if one speaker is in a 
córner and the other is on a wall, the perceived center image will be off balance. 
As with many techno-artistic endeavors, studio acoustics and design are a 
mixture of fundamental physics (in this case, mostly dimensional mathematics) 
and an equally large dose of common sense and dumb luck. More often than 
not, acoustics is an artistic Science that melds physics with the art of intuition 
and experience. 


STUDIO TYPES 

Although the acoustical fundamentáis are the same for most studio design 
types, differences will often follow the form, function and budgets of the 
required tasks at hand. Some of the more common studio types inelude: 

■ Professional music studios 

■ Audio-for-visual production environments 

■ Project studios 

■ Portable studios. 

The professional recording studio 

The professional recording studio (Figures 3.1 and 3.2) is first and foremost a com- 
mercial business, so its design, decor, and acoustical construction requirements 
are often much more demanding than those of a privately owned project studio. 
In some cases, an acoustical designer and experienced construction team are 
placed in charge of the overall building phase of a professional facility. In others, 
the studio's budget preeludes the hiring of such professionals, which places the 
studio owners and staff squarely in charge of designing and constructing the 
entire facility. Whether you happen to have the luxury of building a new facility 
from the ground up or are renovating a studio within an existing shell, you 
would probably benefit from a professional studio designer's experience and 
skills. Such expert advice often proves to be cost effective in the long run, 
because errors in design judgment can lead to cost overruns, lost business due 
to unexpected delays or the unfortunate State of living with mistakes that could 
have been easily avoided. 


Studio Acoustics and Design ;]APTfl O 




FIGURE 3.1 

BiCoastal Music, 
Ossining, NY: 

(a) control room; 

(b) recording studio. 
(Courtesy of Russ 
Berger Design Group, 
Inc., www.rbdg.com.) 


The audio-for-visual production environment 

An audio-for-visual production facility is used for video, film and game post- 
production (often simply called "post") and ineludes such facets as music 
recording for film or other media (scoring), score mixdown, automatic dialog 
replacement (ADR, which is the replacement of on- and off-screen dialog to 
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FIGURE 3.2 

Oven Studios, Alicia 
Keys, Long Island, 
NY: (a) control room; 
(b) recording studio. 
(Courtesy of 
Walters-Storyk 
Design Group, www. 
wsdg.com.) 



visual media) and Foley (the replacement and creation of on- and off-screen 
sound effects). As with music studios, audio-for-visual production facilities can 
range from high-end facilities that can accommodate the posting needs of 
network video or feature film productions (Figure 3.3) to a simple, budget- 
minded project studio that's equipped with video and a digital audio work- 
station. As with the music studio, audio-for-visual construction and design 
techniques often span a wide range of styles and scope in order to fit the budget 
needs at hand. 


The project studio 

It goes without saying that the vast majority of audio production studios 
fall into the project studio category. This basic definition of such a facility is 
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FIGURE 3.3 

Skywalker Sound 
scoring stage control 
room, Marín County, 
CA. (Courtesy of 
Skywalker Sound, 
www.skysound.com.) 



FIGURE 3.4 

Gettin’ it all going in 
the bedroom studio. 
(Courtesy of Yamaha 
Corporation of 
America, www. 
yamaha.com.) 


open to interpretation. It's usually intended as a personal production resource 
for recording music, audio-for-visual production, multimedia production, 
voiceovers ... you ñame it. Project studios can range from being fully commer- 
cial in nature to smaller setups that are both personal and private (Figure 3.4). 
All of these possible studio types have been designed with the idea of giving 
artists the flexibility of making their art in a personal, off-the-clock environment 
that's both cost and time effective. The design and construction considerations 
for creating a privately owned project studio often differ from the design con¬ 
siderations for a professional music facility in two fundamental ways: 

■ Building constraints 

■ Cost. 
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FIGURE 3.5 

WorkirT at light 
speed in the project 
studio. 



Generally, a project studio's room (or series of rooms) is built into an artist's 
home or a rented space where the construction and dimensional details are 
already defined. This fact (combined with inherent cost considerations) often 
leads the owner/artist to employ cost-effective techniques for sonically treating 
a room. Even if the room has little or no treatment, keep in mind that a basic 
knowledge of acoustical physics and room design can be a handy and cost- 
effective tool as your experience, production needs and business abilities grow. 

Modern-day digital audio workstations (DAWs) have squarely placed the Mac 
and PC within the ergonomics and functionality of the project studio (Figure 
3.5). In fact, in many cases, the DAW "is" the project studio. With the advent 
of self-powered speaker monitors, cost-effective microphones and hardware 
DAW controllers, it's a relatively simple matter to design a powerful production 
system into almost any existing space. 


With regard to setting up any production/monitoring 
environment, l’d like to draw your attention to the need 
for symmetry in any critical monitoring environment. A 
symmetrical acoustic environment around the central 
mixing axis can work wonders toward creating a 
balanced left/right and surround image. Fortunately, this 
often isn’t a difficult goal to achieve. An acoustical and 
speaker placement environment that isn’t balanced 


between the left-hand and right-hand sides will allow for 
differing reflections, absorption coefficients and variations 
in frequency response that can adversely affect the 
imaging and balance of your final mix. Further information 
on this important subject can be found later in this 
chapter ... consider this your first heads-up on this 
important topic. 


The portable studio 

In this digital age, it goes without saying that the Mac, PC and other hardware 
options are available in portable configurations that can be easily taken on the 
road (Figure 3.6). With their increased power, portability and inherent cool 
factor, these small, lightweight powerhouses can often dish out serious music 
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FIGURE 3.6 

Recording out in the 
field. 


production power with the big boys. Of course, the real power and flexibility 
of a portable recording system rests in the freedom that it offers to escape the 
confines of the studio. For example, would you like to: 

■ Record a guitar overdub in the local high school gym? 

■ Record your band's latest performance with a handheld recorder? 

■ Sample a car door slam at a Mexican Market? 

The sky's literally the limit! 

Portable recording systems have decreased in size and cost while increasing in 
power to the point where they've literally become a powerful studio on the go. 
Again, with the advent of self-powered speaker monitors and cost-effective 
microphones and hardware DAW interface/controller devices, these small-fry 
systems offer up tremendous amounts of production power, while being light 
on the pocketbook. Further discussions on portable and desktop digital systems 
can be found in Chapter 7. 

PRIMARY FACTORS GOVERNING STUDIO 
AND CONTROL ROOM ACOUSTICS 

Regardless of which type of studio facility is being designed, built and used, a 
number of primary concerns should be addressed in order to achieve the best 
possible acoustic results. In this section, we'll take a cióse look at such important 
and relevant aspects of acoustics as: 

■ Acoustic isolation 

■ Symmetry in control room and monitoring design 

■ Frequency balance 
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■ Absorption 

■ Reflection 

■ Reverberation. 

Although several mathematical formulas have been included in the following 
sections, it's by no means necessary that you memorize or worry about them. 
By far, I feel that it's more important that you grasp the basic principies of 
acoustics rather than fret over the underlying math. Remember: More often than 
not, acoustics is an artistic Science that blends math with the art of intuition 
and experience. 

Acoustic isolation 

Because most commercial and project studio environments make use of 
an acoustic space to record sound, it's often wise and necessary to employ effec- 
tive isolation techniques into their design in order to keep external noises 
to a minimum. Whether that noise is transmitted through the médium of air 
(e.g., from nearby auto, train, or jet traffic) or through solids (e.g., from air- 
conditioner rumbling, underground subways, or nearby businesses), special 
construction techniques will often be required to dampen these extraneous 
sounds (Figure 3.7). 

If you happen to have the luxury of building a studio facility from the ground 
up, a great deal of thought should be put into selecting the studio's location. If 
a location has considerable neighborhood noise, you might have to resort to 
extensive (and expensive) construction techniques that can "float" the rooms 
(a process that effectively isolates and uncouples the inner rooms from the 
building's outer foundations). If there's absolutely no choice of studio location 
and the studio happens to be located next to a factory, just under the airport's 


FIGURE 3.7 

Various isolation, 
absorption, and 
reflective acoustical 
treatments for the 
construction of a 
recording/monitoring 
environment. 
(Courtesy of Auralex 
Acoustics, www. 
auralex.com.) 
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main landing path or over the subway's 
uptown line ... you'll simply have to give in 
to destiny and build acoustical barriers to 
these outside interferences. 

The reduction in the sound-pressure level 
(SPL) of a sound source as it passes through 
an acoustic barrier of a certain physical mass 
(Figure 3.8) is termed the transmission loss 
(TL) of a signal. This attenuation can be 
expressed (in dB) as: 



TL = 14.5log At + 23 

where TL is the transmission loss in decibels, 


transmission loss 

\ 


and ísA is the surface density (or combined surface densities) of a barrier in 
pounds per square foot (lb/ft 2 ). 

Because transmission loss is frequency dependent, the following equation can 
be used to calcúlate transmission loss at various frequencies with some degree 
of accuracy: 


FIGURE 3.8 

Transmission loss 
refers to the 
reduction of a sound 
signal (in dB) as it 
passes through an 
acoustic barrier. 


TL = 14.5logA4/ - 16 
where/is the frequency (in hertz). 

Both common sense and the preceding two equations tell us that heavier acous¬ 
tic barriers will yield a higher transmission loss. For example, Table 3.1 tells 
us that a 12-inch-thick wall of dense concrete (yielding a surface density of 
150 lb/ft 2 ) offers a much greater resistance to the transmission of sound than 
can a 4-inch cavity filled with sand (which yields a surface density of 32.3 lb/ft 2 ). 
From the second equation (TL = 14.5 log Mf - 16), we can also draw the con¬ 
clusión that, for a given acoustic barrier, transmission losses will increase as the 
frequency rises. This can be easily illustrated by closing the door of a car that 
has its sound system turned up, or by shutting a single door to a music studio's 
control room. In both instances, the high frequencies will be greatly reduced in 
level, while the bass frequencies will be impeded to a much lesser extent. From 
this, the goal would seem to be to build a studio wall, floor, ceiling, window 
or door out of the thickest and most dense material that's available; however, 
expense and physical space often play roles in determining just how much of a 
barrier can be built to achieve the desired isolation. As such, a balance must 
usually be struck when using both space- and cost-effective building materials. 

WALL5 

When building a studio wall or reinforcing an existing structure, the primary 
goal is to reduce leakage (increase the transmission loss) through a wall as much 
as possible over the audible frequency range. This is generally done by: 



Primary Factors Governing Studio and Control Room Acoustics 


Table 3.1 Surface densities of common building materials. 

Material 

Thickness (inches) 

Surface Density (lb/ft 2 ) 

Brick 

4 

40.0 


8 

80.0 

Concrete (lightweight) 

4 

33.0 


12 

100.0 

Concrete (dense) 

4 

50.0 


12 

150.0 

Glass 

— 

3.8 


— 

7.5 


— 

11.3 

Gypsum wallboard 

— 

2.1 


— 

2.6 

Lead 

1/16 

3.6 

Particleboard 

— 

1.7 

Plywood 

— 

2.3 

Sand 

1 

8.1 


4 

32.3 

Steel 

— 

10.0 

Wood 

1 

2.4 


■ Building a wall structure that is as massive as is practically possible (both 
in terms of cubic and square foot density) 

■ Eliminating open joints that can easily transmit sound through the barrier 

■ Dampening structures, so that they are well supported by reinforcement 
structures and are free of resonances. 

The following guidelines can be helpful in the construction of framed walls that 
have high transmission losses: 

■ If at all possible, the inner and outer wallboards should not be directly 
attached to the same wall studs. The best way to avoid this is to alternately 
stagger the studs along the floor and ceiling frame, so that the front/back 
facing walls aren't in physical contact with each other (Figure 3.9a). 

■ Each wall facing should have a different density to reduce the likelihood 
of increased transmission due to resonant frequencies that might be sym- 
pathetic to both sides. For example, one wall might be constructed of two 
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FIGURE 3.9 

Double, staggered 
stud construction 
greatly reduces 
leakage by decou- 
pling the two wall 
surfaces from each 
other: (a) top view 
showing offset studs; 
(b) side view showing 
the application of 
caulk at all joint 
points to reduce 
leakage. 


5/8-inch gypsum wallboards, while the other wall might be composed of 
soft fiberboard that's surfaced with two 1/2-inch gypsum wallboards. 

■ If you're going to attach gypsum wallboards to a single wall face, you can 
increase transmission loss by mounting the additional layers (not the first 
layer) with adhesive caulking rather than using screws or nails. 

■ Spacing the studs 24 inches on center instead of using the traditional 
16-inch spacing yields a slight increase in transmission loss. 

■ To reduce leakage that might make it through the cracks, apply a bead of 
nonhardening caulk sealant to the inner gypsum wallboard layer at the 
wall-to-floor, wall-to-ceiling and córner junctions (Figure 3.9b). 

Generally, the same amount of isolation is required between the studio and the 
control room as is required between the studio's interior and exterior environ- 
ments. The proper building of this wall is important, so that an accurate tonal 
balance can be heard over the control-room monitors without promoting 
leakage between the rooms or producing resonances within the wall that would 
audibly color the signal. Optionally, a specially designed cavity, called a soffit, 
can be designed into the front-facing wall of the control room to house the 
larger studio monitors. This superstructure allows the main, farfield studio 
monitors to be mounted directly into the wall to reduce reflections and reso¬ 
nances in the monitoring environment. 

It's important for the soffit to be constructed to high standards, using a multiple- 
wall or high-mass design that maximizes the density with acoustically tight 
construction techniques in order to reduce leakage between the two rooms. 
Cutting corners by using substandard (and even standard) construction tech¬ 
niques in the building of a studio soffit can lead to unfortunate side effects, 
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such as wall resonances, rattles, and increased leakage. Typical wall construction 
materials inelude: 

■ Concrete: This is the best and most solid material, but it is often expensive 
and it's not always possible to pour cement into an existing design. 

■ Bricks (hollow-form or solid-facing): This excellent material is often easier to 
place into an existing room than concrete. 

■ Gypsum plasterboard: Building múltiple layers of plasterboard onto a dou- 
ble-walled stud frame is often the most cost- and design-efficient approach 
to reducing resonances and maximizing transmission loss. It's often a 
good idea to reduce these resonances by filling the wall cavities with rock- 
wool or fiberglass, while bracing the internal structure to add an extra 
degree of stiffness. 

Studio monitors can be designed into the soffit in a number of ways. In one 
expensive approach, the speakers' inner enclosures are cavities designed into 
walls that are made from a single concrete pour. Under these conditions, reso¬ 
nances are completely eliminated. Another less expensive approach has the 
studio monitors resting on poured concrete pedestals; in this situation, inserís 
can be cast into the pedestals that can accept threaded rebar rods (known as 
all-thread). By filing the rods to a chamfer or a sharper point, it's possible to 
adjust the position, slant and height of the monitors for final positioning into 
the soffit's wall framing. The most common and affordable approach uses tra- 
ditional wood framing in order to design a cavity into which the speaker enclo¬ 
sures can be designed and positioned. Extra bracing and heavy construction 
should be used to reduce resonances. 

FLOORS 

For many recording facilities, the isolation of floor-borne noises from room and 
building exteriors is an important consideration. For example, a building that's 
located on a busy Street and whose concrete floor is tied to the building's ground 
foundation might experience severe low-frequency rumble from nearby traffic. 
Alternatively, a second-floor facility might experience undue leakage from a 
noisy downstairs neighbor or, more likely, might interfere with a quieter neigh- 
bor's business. In each of these situations, increasing the isolation to reduce 
floor-borne leakage and/or transmission is essential. One of the most common 
ways to isolate floor-related noise is to construct a "floating" floor that is struc- 
turally decoupled from its subfloor foundation. 

Common construction methods for floating a professional facility's floor uses 
either neoprene "hockey puck" isolation mounts, U-Boat floor floaters (Figure 
3.10), or a continuous underlay, such as a rubberized floor mat. In these cases, 
the underlay is spread over the existing floor foundation and then covered with 
an overlaid plywood floor structure. In more extreme situations, this superstruc- 
ture could be covered with reinforcing wire mesh and finally topped with a 
4-inch layer of concrete (Figure 3.11). In either case, the isolated floor is then 
ready for carpeting, wood finishing, painting or any other desired surface. 
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Caulk this small gap at each layer 


1/2" or 5/8" Partida Board 


5/8" Drywall 
3/4" Particle Board 



Glue these layers together 
& screw them to the 


A framing members 


Insulate this cavity 


Framing member 
(2X4 shown) 

U-Boat(Duh) 


Existing wall 


SheetBIok 


FIGURE 3.10 

U-Boat™ floor beam 
float channels can 
be placed under a 
standard 2x4 floor 
frame to increase 
isolation. Floor 
floaters should be 
placed every 16 
inches under a 2x 
floor joist. 



FIGURE 3.11 

Basic guidelines for 
building a concrete 
floating floor using 
neoprene mounts. 


An even more cost- and space-effective way to decouple a floor involves layering 
the original floor with a rubberized or carpet foam pad. A 1/2- or 5/8-inch layer 
of tongue-and-groove plywood or oriented strand board (OSB) is then laid on 
top of the pad. These should not be nailed to the subfloor; instead, they can be 
stabilized by glue or by locking the pieces together with thin, metal braces. 
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Another foam pad can then be laid over this structure and topped with carpet- 
ing or any other desired finishing material (Figure 3.12). 

It is important for the floating superstructure to be isolated from both the 
underflooring and the outer wall. Failing to isolate these allows floor-borne 
sounds to be transmitted through the walls to the subfloor—and vice versa 
(often defeating the whole purpose of floating the floor). These wall perimeter 
isolation gaps can be sealed with pliable decoupling materials such as widths 
of soft mineral fiberboard, neoprene, silicone or other pliable materials. 


RISERS 



FIGURE 3.12 

An alternative, 
cost-effective way 
to float an existing 
floor by layering 
relatively inexpensive 
materials. 


As we saw from the equation TL = 14.5 log Mf- 16, low-frequency sound travels 
through barriers much more easily than does high-frequency sound. It stands 
to reason that strong, low-frequency energy is transmitted more easily than 
high-frequency energy between studio rooms, from the studio to the control 
room or to outside locations. In general, the drum set is most likely to be the 

biggest leakage offender. By decoupling much of 
a drum set's low-frequency energy from a studio 
floor, many of the low-frequency leakage prob- 
lems can be reduced. In most cases, the problem 
can be fixed by using a drum riser. Drum risers 
are available commercially (Figure 3.13), or they 
can be easily constructed. In order to reduce 
unwanted resonances, drum risers should be 
constructed using 2- x 6-inch or 2- x 8-inch 
beams for both the frame and the supporting 
joists (spaced at 16 or 12 inches on center, as 
shown in Figure 3.14). Sturdy 1/2- or 5/8-inch 
tongue-and-groove plywood panels should be 
glued to the supporting frames with carpenter's 
glue (or a similar wood glue) and then nailed or 


floor 


FIGURE 3.13 

HoverDeck™ 88 
isolation riser. 
(Courtesy of Auralex 
Acoustics, www. 
auralex.com.) 












Studio Acoustics and Design 


CHAPTER 3 O 



place carpet or rubber underneath 
joists for additional isolation 


FIGURE 3.14 

General construction 
details for a 
homemade drum 
riser. 


screwed down (using heavy-duty, galvanized fas- 
teners). When the frame has dried, rubber coaster 
float channels or (at the very least) strips of car- 
peting should be attached to the bottom of the 
frame ... and the riser will be ready for action. 

CEILINGS - 

Foot traffic and other noises from above a sound 
studio or production room are another common 
source of external leakage. Ceiling noise can be 

isolated in a number of ways. If foot traffic is - 

your problem and you're fortúnate enough to - 

own the floors above you, you can reduce this 

noise by simply carpeting the overhead hallway or by floating the upper floor. 
If you don't have that luxury, one approach to deadening ceiling-borne sounds 
is to hang a false structure from the existing ceiling or from the overhead joists 
(as is often done when a new room is being constructed). This technique can 
be fairly cost effective when "Z" suspensión channels are used (Figure 3.15). Z 
channels are often screwed to the ceiling joists to provide a flexible, yet strong 
support to which a hanging wallboard ceiling can be attached. If necessary, 
fiberglass or other sound-deadening materials can be placed into the cavities 
between the overhead structures. Other more expensive methods use spring 
support Systems to hang false ceilings from an existing structure. 



FIGURE 3.15 

Z channels can be 
used to hang a 
floating ceiling from 
an existing overhead 
structure. 


WINDOWS AND DOOR5 

Access to and from a studio or production room area (in the form of Windows 
and doors) can also be a potential source of sound leakage. For this reason, 
strict attention needs to be given to window and door design and construction. 
Visibility in a studio is extremely important within a music production environ- 
ment. For example, when múltiple rooms are involved, good visibility serves to 
promote effective communication between the producer or engineer and the 
studio musician (as well as among the musicians themselves). For this reason, 
Windows have been an important factor in studio design since the beginning. 
The design and construction details for a window often vary with studio needs 
and budget requirements and can range from being deep, double-plate cavities 
that are built into double-wall constructions (Figure 3.16) to more modest 
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JAMB DETAI L 




COVER GAP IN PLYWOOD WITH 6 
MIL POLYETHYELENE VAPOR 
BARRIERATALL SIDES. 

MAINTAIN 1/2" CLEARANCE 
BETWEEN ADJACENT 
FRAMES. 

HEADER AS REQUIRED, 


3/4* PLYWOOD FRAME. 

SET FRAME AND STOPS IN 
ACOUSTICAL SEALANT. 

_ XX-XX" A.F.F.v . 

T.O.TRIM^ 


1 1/2" xl 1/2 M WOOD STOP, 


V’ ACOUSTICAL 
TREATMENT WITH 
ACOUSTICAL FOAM AND 
BLACK FABRIC FACING. 


SOUND RATED LAMlNATED 
GLASS. THICKNESS AND 
MIN. STC RATING AS 
SCHEDULED, 


XX-XX" A.F.F,^ 
T.O, SILL^ 


COVER GAP |N PLYWOOD W|TH 6 
MIL POLYETHYELENE VAPOR 
BARRIERATALL SIDES. 


FIGURE 3.16 

Detail for a practical window construction between the control room and studio. (Courtesy of Russ Berger Design 
Group, Inc., www.rbdg.com.) 
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FIGURE 3.17 

A SoundSecure™ 
studio door. 

(Courtesy of Acoustic 
Systems, www. 
acousticsystems. 
com.) 


prefab designs that are built into a single wall. Other 
more expensive designs inelude floor-to-ceiling Windows 
that create a virtual "glass wall," as well as those that 
offer sweeping vistas, which are designed into poured sound lock 
concrete soffit walls. 

Access doors to and from the studio, control room, and 
exterior areas should be constructed of solid wood or 
high-quality acoustical materials (Figure 3.17), as solid 
doors generally offer higher TL valúes than their cheaper, 
hollow counterparts. No matter which door type is 
used, the appropriate seáis, weather-stripping, and 
doorjambs should be used throughout so as to reduce 
leakage through the cracks. Whenever possible, double- 
door designs should be used to form an acoustical sound FIGURE 3.18 

lock (Figure 3.18). This construction technique dramatically reduces leakage 
because the air trapped between the two solid barriers offers up high TL valúes. 

ISO-ROOMS AND ISO-BOOTHS 

Isolation rooms ( iso-rooms ) are acoustically isolated or sealed areas that are built 
into a music studio or just off of a control room (Figure 3.19). These recording 
areas can be used to sepárate louder instruments from softer ones (and vice 
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FIGURE 3.19 

Iso-room design at 
Studio Records, LLC, 
Ft. Worth, TX. 
(Courtesy of Russ 
Berger Design Group, 
Inc., www.rbdg.com.) 



versa) in order to reduce leakage and to sepárate instrument types by volume 
to maintain control over the overall ensemble balance. For example: 

■ To elimínate leakage when recording scratch vocals (a guide vocal track 
that's laid down as a session reference), a vocalist might be placed in a 
small room while the rhythm ensemble is placed in the larger studio area. 

■ A piano or other instrument could be isolated from the larger area that's 
housing a full string ensemble. 

■ A B3 organ could be blaring away in an iso-room while backing vocals 
are being laid down in the main room. ... The possibilities are endless. 

An iso-room can be designed to have any number of acoustical properties. By 
having múltiple rooms and/or iso-room designs in a studio, several acoustical 
environments can be offered that range from being more reflective (live) to 
absorptive (dead) ... or a specific room can be designed to better fit the acous¬ 
tical needs of a particular instrument (e.g., drums, piano or vocals). These rooms 
can be designed as totally sepárate areas that can be accessed from the main 
studio or control room, or they might be directly tied to the main studio by 
way of sliding walls or glass sliding doors. In short, their form and function can 
be put to use to fit the needs and personality of the session. 

Isolation booths ( iso-booths ) provide the same type of isolation as an iso-room, 
but are often much smaller (Figure 3.20). Often called vocal booths, these mini- 
studios are perfect for isolating vocals and single instruments from the larger 
studio. In fact, rooms that have been designed and built for the express purpose 
of mixing down a recording will often only have an iso-booth ... and no other 
recording room. LJsing this space-saving option, vocals or single instruments 
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FIGURE 3.20 

Example of an 
iso-booth in action. 
(Courtesy of www. 
misheeddins.com.) 


can be easily overdubbed on site, and should more space be needed a larger 
studio can be booked to fit the bilí. 

NOISE ISOLATION WITHIN THE CONTROL ROOM 

Isolation between rooms and the great outdoors isn't the only noise-related 
issue in the modern-day recording or project studio. The proliferation of mul- 
titrack tape machines, computers, digital signal processors (DSPs) and cooling 
Systems has created issues that present their own Grinch-like types of noise, 
Noise, NOISE, NOISE!!! This usually manifests itself in the form of system fan 
noise, transport tape noise and Computer-related sounds from CPLJs, case fans, 
hard drives and the like. 

When it comes to isolating tape transport and system fan sounds, should budget 
and size constraints permit, it is often wise to build an iso-room or iso-closet 
that's been specifically designed and ventilated for containing such equipment. 
An equipment room that has easy-access doors that provide for current/future 
wiring needs can add a degree of peace-'n-quiet and an overall professionalism 
that will make both you and your clients happy. 

Within a smaller studio or project studio space, such a room isn't always pos- 
sible; however, with care and forethought the whizzes and whirrs of the digital 
era can be turned into a nonissue that you'll be proud of. 

Here are a few examples of the most common problems and their Solutions: 

■ Replace fans with quieter ones. By doing some careful Web searching or 
by talking to your favorite Computer salesperson, it's often possible to 
install CPU and case fans that are quieter than most off-the-shelf models. 
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■ Regúlate case fan speeds with variable pots. Gamers will often regúlate 
their Computer fans in order to keep noises down. Care needs to be taken 
to monitor the CPU/case temperatures so as not to harm your system. 

■ Install hard-drive enclosures to reduce internal hard-drive noises. These 
enclosures are designed to acoustically encapsulate 3.5-inch hard drives 
into a design that fits into a 5.25-inch hard-drive bay. 

■ Place the Computer in an acoustically isolated alcove. Again, care needs to 
be taken to monitor the CPU/case temperatures so as not to harm your 
system. 

■ Connect studio computers via a high-speed network to a remóte server 
location. By connecting a silent Computer (such as a small form factor PC 
or Mac laptop) to a central Computer via a high-speed network, not only 
is it possible to eliminate computer-related noises (by locating the Com¬ 
puter and drives in another room), but it's also a relatively simple matter 
to connect various production- and business-related termináis to a central 
server that can be backed up according to a schedule. 

ACOUSTIC PARTITIONS 

Movable acoustic partitions (also known as fíats or gobos ) are commonly used 
in studios to provide on-the-spot barriers to sound leakage. By partitioning 
a musician and/or instrument on one or more sides and then placing the 
mic inside the temporary enclosure, isolation can be greatly improved in a 
flexible way that can be easily changed as new situations arise. Acoustic parti¬ 
tions are currently available on the commercial market in various design styles 
and types for use in a wide range of studio applications (Figure 3.21). For those 
on a budget, or who have particular isolation needs, it's a relatively simple 
matter to get out the workshop tools and make your own fíats that are based 
around wood frames, fiberglass or other acoustically absorptive materials with 
your favorite colored fabric coverings—and ingenious craftsmanship (Figure 
3.22). 

If a fíat can't be found in sight, acoustic partitions can be easily built using 
common studio and household Ítems. For example, a simple partition can be 
easily made on the spot by grabbing a mic/boom stand combination and 
retracting the boom halfway at a 90° angle to make a T-shape. Simply drape a 
blanket or heavy coat over the T-bar and voilá—you've built a quick-'n-dirty 
dividing fíat. 

When using a partition, it's important to be aware of the musician's need to 
have good visibility with other musicians, the conductor and the producer. 
Musicality and human connectivity almost always take precedence over techni- 
cal issues. 

Symmetry in control room design 

While many professional studios are built from the ground up to standard 
acoustic and architectural guidelines, most budget-minded production and 
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FIGURE 3.21 

Acoustic partition fíat 
examples: (a) S5-2L 
“Sorber” baffle 
system (courtesy of 
ClearSonic Mfg., Inc., 
www.clearsonic. 
com); (b) piano panel 
setup (courtesy of 
Auralex Acoustics, 
www.auralex.com). 


project studios are often limited by their own unique sets of building, space 
and acoustic constraints. Even though the design of a budget, project or bedroom 
control room might not be acoustically perfect, if speakers are to be used in the 
monitoring environment, certain ground rules of acoustical physics must be 
followed in order to create a proper listening environment. 
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FIGURE 3.22 

Examples of a 
homemade fíat: 

(a) the “blanket 
and a boom” trick; 

(b) homemade fíat 
design. 



One of the most important acoustic design rules in a monitoring environment 
is the need for symmetrical reflections on all axes within the design of a control 
room or single-room project studio. In short, the center and acoustic imaging 
(ability to discriminate placement and balance in a stereo or surround field) is 
best when the listener, speakers, walls and other acoustical boundaries are sym- 
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metrically centered about the listener's position (often in an equilateral trian- 
gle). In a rectangular room, the best low-end response can be obtained by 
orienting the consolé and loudspeakers into the room's long dimensión (Figure 
3.23a). Should space or other room considerations come into play, centering 
the listener/monitoring position at a 45° angle within a symmetrical córner 



(a) 



FIGURE 3.23 

Various acceptable 
symmetries in a 
monitoring environ- 
ment: (a) Acoustic 
reflections must be 
symmetrical about 
the listener’s 
position. In addition, 
orienting a control 
room along the long 
dimensión can 
extend the room’s 
low-end response. 

(b) Placing the 
listening environment 
symmetrically in a 
córner is another 
example of how the 
left/right imagery can 
be improved over an 
off-center placement. 
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(Figure 3.23b) is another example of how the left/right imagery can be largely 
maintained. 

Should any primary boundaries of a control room (especially wall or ceiling 
boundaries near the mixing position) be asymmetrical from side to side, sounds 
heard by one ear will receive one combination of direct and reflected sounds, 
while the other ear will hear a different acoustic balance (Figure 3.24). This 
condition can drastically alter the sound's center image characteristics, so that 
when a sound is actually panned between the two monitor speakers the sound 
will appear to be centered; however, when the sound is heard in another studio 
or standard listening environment the imaging may be off center. To avoid 
this problem, care should be taken to ensure that both the side and ceiling 
boundaries are largely symmetrical with respect to each other and that all of 
the speaker level balances are properly set. 

While we're on the subject of the relationship between the room's acoustic 
layout and speaker placement, it's always wise to place nearfield and all other 
speaker enclosures at points that are equidistant to the listener in the stereo and 
surround field. Whenever possible, speaker enclosures should be placed 1 to 2 
feet away from the nearest wall and/or córner, which helps to avoid bass build- 
ups that acoustically occur at boundary and córner locations. In addition to 
strategic speaker placement, homemade or commercially available isolation 
pads (Figure 3.25) can be used to reduce resonances that often occur whenever 
enclosures are placed directly onto a table or fíat surface. 


FIGURE 3.24 

Placing the monitor- 
ing environment 
off-center and in a 
córner will affect the 
audible center image, 
and placing one 
speaker in a 90° 
córner can cause an 
off-center bass 
buildup and 
adversely affect the 
mix’s imagery. 
Shifting the listener/ 
monitoring position 
into the center will 
greatly improve the 
left/right imagery. 
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FIGURE 3.25 

Speaker isolation pads can help to reduce speaker/stand resonances. (a) Auralex MoPAD™ speaker isolation pad 
(courtesy of Auralex Acoustics, www.auralex.com); (b) Primacoustic Recoil Stabilizer pad (courtesy of Primacoustic, 
www.primacoustic.com). 


Frequency balance 

Another important factor 
in room design is the need 
for maintaining the origi¬ 
nal frequency balance of an 
acoustic signal. In other 
words, the room should 
exhibit a relatively fíat fre¬ 
quency response over the entire audio range without adding its own particular 
sound coloration. The most common way to control the tonal character of a 
room is to use materials and design techniques that govern the acoustical reflec- 
tion and absorption factors. 



FIGURE 3.26 

Sound reflects off a 
surface at an angle 
equal (and opposite) 
to its original angle 
of incidence, much 
as light will reflect 
off a mirror. 


REFLECTIONS 

One of the most important characteristics of sound as it travels through air is 
its ability to reflect off a boundary's surface at an angle that's equal to (and 
opposite of) its original angle of incidence (Figure 3.26). Just as light bounces 
off a mirrored surface or múltiple reflections can appear within a mirrored 
room, sound reflects throughout room surfaces in ways that are often amazingly 
complex. Through careful control of these reflections, a room can be altered to 
improve its frequency response and sonic character. 
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In Chapter 2, we learned that sonic reflections can be controlled in ways that 
disperse the sound outward in a wide-angled pattern (through the use of a 
convex surface) or focus them on a specific point (through the use of a concave 
surface). Other surface shapes, on the other hand, can reflect sound back at 
various other angles. For example, a 90° córner will reflect sound back in the 
same direction as its incident source (a fact that accounts for the additive acous- 
tic buildups at various frequencies at or near a wall-to-corner or corner-to-floor 
intersection). 

The all-time winner of the "avoid this at all possible costs" award goes to con- 
structions that inelude opposing parallel walls in its design. Such conditions 
give rise to a phenomenon known as standing waves. Standing waves (also 
known as room modes) occur when sound is reflected off of parallel surfaces 
and travels back on its own path, thereby causing phase differences to interfere 
with a room's amplitude response (Figure 3.27). Room modes are expressed as 
integer múltiples of the length, width and depth of the room and indicate which 
múltiple is being referred to for a particular reflection. 

Walking around a room with modérate to severe mode problems produces the 
sensation of increasing and/or decreasing volume levels at various frequencies 
throughout the area. These perceived volume changes are due to amplitude 
(phase) cancellations and reinforcements of the combined reflected waveforms 
at the listener's position. The distance between parallel surfaces and the signal's 
wavelength determines the nodal points that can potentially cause sharp peaks 


FIGURE 3.27 

Standing waves 
within a room with 
reflective parallel 
surfaces can 
potentially cancel 
and reinforce 
frequencies within 
the audible spec- 
trum, causing 
changes in its 
response. 


side 

standing 
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or dips at various points in the response curve (up 
to or beyond 19 dB) at the affected fundamental 
frequency (or frequencies) and upper harmonio 
intervals (Figure 3.28). This condition exists not only 
for opposing parallel walls but also for all parallel 
surfaces (such as between the floor and ceiling or 
between two reflective fíats). From this discussion, 
it's obvious that the most effective way to prevent 
standing waves is to construct walls, boundaries and 
ceilings that are nonparallel. 

If the room in question is rectangular or if further 
sound-wave dispersión is desired, diffusers can be 
attached to the wall and/or ceiling boundaries to 
help breakup standing waves. Diffusers (Figure 3.29) 
are acoustical boundaries that reflect the sound wave 
back at various angles that are wider than the origi¬ 
nal incident angle (thereby breaking up the energy- 
destructive standing waves). In addition, the use of 
both nonparallel and diffusion wall construction can 
reduce extreme, recurring reflections and smooth out 
the reverberation characteristics of a room by build- 
ing more complex acoustical pathways. 


front reflective wall rear reflective wall 





Flutter echo (also called slap echo ) is a condition that 
occurs when parallel boundaries are spaced far 
enough apart that the listener is able to discern a 
number of discrete echoes. Flutter echo often pro¬ 



duces a "boingy," hollow sound that greatly affects a 

room's sound character as well as its frequency response. A larger room (which 
might contain delayed echo paths of 50 msec or more) can have its echoes 
spaced far enough apart in time that the discrete reflections produce echoes that 
actually interfere with the intelligibility of the direct sound, often resulting in 
a jumble of noise. In these cases, the proper application of absorption and 
acoustic dispersión becomes critical. 

When speaking of reflections within a studio control room, one long-held 
design concept relates to the concept of designing the room such that the rear 
of the room is largely reflective and diffuse in nature (acoustically "Uve"), while 
the front of the room is largely or partially absorptive (acoustically "dead"). 
This philosophy (Figure 3.30) argües for the fact that the rear of the room 
should be largely reflective (Figure 3.31), providing for a balanced environment 
that can help reinforce positive reflections that can add acoustic "life" to the 
mix experience. The front of the room would tend more toward the absorptive 
side in a way that would reduce standing-wave and flutter reflections that would 
interfere with the overall response of the room. 


FIGURE 3.28 

The reflective, 
parallel walls create 
an undue number 
of standing waves, 
which occur at 
various frequency 
intervals (í, f 2 , h, f 4 , 
and so on). 
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FIGURE 3.29 

Commercial diffuser 
examples: (a) 
T’Fusor™ sound 
diffusers (courtesy of 
Auralex Acoustics, 
www.auralex.com); 

(b) SpaceArray sound 
diffusers (courtesy of 
pArtScience, www. 
partscience.com); 

(c) open-ended view 
of a Primacoustic™ 
Razorblade quadratic 
diffuser (courtesy of 
Primacoustic Studio 
Acoustics, www. 
primacoustic.com). 
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FIGURE 3.29 

Continued 



FIGURE 3.30 

Control-room layout 
showing the live end 
toward the back of 
the room and the 
dead end toward the 
front of the room. 


It’s ¡mportant to realize that no two rooms will be acousti- 
cally the same or will necessarily offer the same design 
challenges. The one constant is that careful planning, 
solid design and ingenuity are the foundation of any good 
sounding room. You should also keep ¡n mind that 


numerous studio design and commercial acoustical 
product firms are available that offer assistance for both 
large and small projects. Getting professional advice is a 
good thing. 
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FIGURE 3.31 

Placing bookshelves 
along the rear wall 
can provide both 
diffusion and a place 
for lots of storage. 



direct sound 


reflected sound 



absorbed sound 


FIGURE 3.32 


ABSORPTION 

Another factor that often has a marked effect on 
an acoustic space involves the use of surface 
materials and designs that can absorb unwanted 
sounds (either across the entire audible band or 
at specific frequencies). The absorption of acoustic 
energy is, effectively, the inverse of reflection 
(Figure 3.32). Whenever sound strikes a mate¬ 
rial, the amount of acoustic energy that's 
absorbed relative to the amount that's reflected 
can be expressed as a simple ratio known as the 
material's absorption coefficient. For a given mate¬ 
rial, this can be represented as: 


Absorption occurs 
when only a portion 
of the incident 
acoustic energy is 
reflected back from a 
material’s surface. 


A = I a /I r 

where 7 a is the sound level (in dB) that is absorbed by the surface (often dis- 
sipated in the form of physical heat), and 7 r is the sound level (in dB) that is 
reflected back from the surface. 


The factor (1 - a) is a valué that represents the amount of reflected sound. This 
makes the coefficient a decimal percentage valué between 0 and 1. If we say that 
a surface material has an absorption coefficient of 0.25, we're actually saying 
that the material absorbs 25% of the original acoustic energy and reflects 75% 
of the total sound energy at that frequency. A sample listing of these coefficients 
is provided in Table 3.2. 










Studio Acoustics and Design ijAPTII 


Table 3.2 Absorption coefficients for various materials. 

Coefficients (Hz) 


Material 

125 

250 

500 

1000 

2000 

4000 

Brick, unglazed 

0.03 

0.03 

0.03 

0.04 

0.05 

0.07 

Carpet (heavy, on concrete) 

0.02 

0.06 

0.14 

0.37 

0.60 

0.65 

Carpet (with látex backing, on 
40-oz hair-felt or foam rubber) 

0.03 

0.04 

0.11 

0.17 

0.24 

0.35 

Concrete or terrazzo 

0.01 

0.01 

0.015 

0.02 

0.02 

0.02 

Wood 

0.15 

0.11 

0.10 

0.07 

0.06 

0.07 

Glass, large heavy píate 

0.18 

0.06 

0.04 

0.03 

0.02 

0.02 

Glass, ordinary window 

0.35 

0.25 

0.18 

0.12 

0.07 

0.04 

Gypsum board nailed to 2 x 4 
studs on 16-inch centers 

0.013 

0.015 

0.02 

0.03 

0.04 

0.05 

Plywood (3/8 inch) 

0.28 

0.22 

0.17 

0.09 

0.10 

0.11 

Air (sabins/1000 ft 3 ) 

— 

— 

— 

— 

2.3 

7.2 

Audience seated in 
upholstered seats 

0.08 

0.27 

0.39 

0.34 

0.48 

0.63 

Concrete block, coarse 

0.36 

0.44 

0.31 

0.29 

0.39 

0.25 

Light velour (10 oz/yd 2 in 
contact with wall) 

0.29 

0.10 

0.05 

0.04 

0.07 

0.09 

Plaster, gypsum, or lime 
(smooth finish on tile or brick) 

0.44 

0.54 

0.60 

0.62 

0.58 

0.50 

Wooden pews 

0.57 

0.61 

0.75 

0.86 

0.91 

0.86 

Chairs, metal or wooden, seats 

0.15 

0.19 

0.22 

0.39 

0.38 

0.30 


unoccupied 

Note: These coefficients were obtained by measurements in the laboratories of the Acoustical Materials Association. 
Coefficients for other materials may be obtained from Bulletin XXII of the association. 

To determine the total amount of absorption that's obtained by the sum of all 
the absorbers within a total volume area, it's necessary to calcúlate the average 
absorption coefficient for all of the surfaces together. The average absorption coef- 
ficient (A ave ) of a room or area can be expressed as: 

A ave = (^ 1^1 S 2 a 2 + • • • + S n a n )/S 

where s v s 2 , ... , s n are the individual surface areas; a lr a 2 , ... , a n are the indi¬ 
vidual absorption coefficients of the individual surface areas; and S is the total 
square surface area. 
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On the subject of absorption, one common misconception is that the use of 
large amounts of sound-deadening materials will reduce room reflections and 
therefore make a room sound "good." In fact, the overuse of absorption will 
often have the effect of reducing high frequencies, creating a skewed room 
response that is dull and bass-heavy—as well as reducing constructive room 
reflections that are important to a properly designed room. In fact, with regard 
to the balance between reflection, diffusion and absorption, many designers 
agree that a balance of 25% absorption and 25% diffuse reflections is a good 
ratio that can help preserve the "life" of a room, while reducing unwanted 
buildups. 

High-frequency absorption 

The absorption of high frequencies is accomplished through the use of dense 
porous materials, such as fiberglass, dense fabric and carpeting. These materials 
generally exhibit high absorption valúes at higher frequencies, which can be 
used to control room reflections in a frequency-dependent manner. Specially 
designed foam and acoustical treatments are also commercially available that 
can be attached easily to recording studio, production room or control-room 
walls as a means of taming múltiple room reflections and/or dampening high- 
frequency reflections (Figure 3.33). 

Low-frequency absorption 

As shown in Table 3.2, materials that are absorptive in the high frequencies 
often provide little resistance to the low-frequency end of the spectrum (and 
vice versa). This occurs because low frequencies are best damped by pliable 
materials, meaning that low-frequency energy is absorbed by the material's 
ability to bend and flex with the incident waveform (Figure 3.34). Rooms that 
haven't been built to the shape and dimensions to properly handle the low end 
will need to be controlled by using bass traps that are tuned to reduce the room's 
resonance frequencies. 

Another absorber type can be used to reduce low-frequency buildup at specific 
frequencies (and their múltiples) within a room. This type of attenuation device 
(known as a bass trap ) is available in a number of design types: 

■ Quarter-wavelength trap 

■ Pressure-zone trap 

■ Functional trap. 

The quarter-wavelength trap The quarter-wavelength bass trap (Figure 3.35) 
is an enclosure with a depth that's one-fourth the wavelength of the offending 
frequency's fundamental frequency and is often built into the rear facing wall, 
ceiling or floor structure and covered by a metal grating to allow foot traffic. 
The physics behind the absorption of a calculated frequency (and many of the 
harmonics that fall above it) rests in the fact that the pressure component of a 
sound wave will be at its máximum at the rear boundary of the trap ... when 
the wave's velocity component is at a minimum. At the mouth of the bass trap 
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FIGURE 3.33 

Commercial 
absorption examples: 
(a) various absorption 
and diffusion wall 
treatments; (b) studio 
showing foam 
absorption treatment 
(courtesy of Auralex 
Acoustics, www. 
auralex.com). 


(which is at a one-fourth wavelength distance from this rear boundary), the 
overall acoustic pressure will be at its lowest, while the velocity component 
(molecular movement) will be at its highest potential. Because the wave's mo- 
tion (forcé) is greatest at the trap's opening, much of the signal can be absorbed 
by placing an absorptive material at that opening point. A low-density fiberglass 
lining can also be placed inside the trap to increase absorption (especially at 
harmonic intervals of the calculated fundamental). 

Pressure-zone trap The pressure-zone bass trap absorber (Figure 3.36) works 
on the principie that sound pressure is doubled at large boundary points that are 
at 90° angles (such as walls and ceilings). By placing highly absorptive material 
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FIGURE 3.34 

Low-frequency absorption. (a) A carefully designed pliable surface can be used to absorb low frequencies. 

(b) Primacoustic™ Polyfuser, a combination diffuser and bass trap (courtesy of Primacoustic Studio Acoustics, www. 
primacoustic.com). 




(a) 


FIGURE 3.35 

A quarter-wavelength bass trap: (a) physical concept design; (b) sound is largely absorbed as heat, since the particle 
velocity (motion) is greatest at the trap’s quarter-wavelength opening. 
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FIGURE 3.36 

LENRD™ bass traps. (Courtesy of Auralex Acoustics, 
www.auralex.com.) 

at a boundary point (or points, in the case of a corner/ceiling intersection), the 
built-up pressure can be partially absorbed. 


FIGURE 3.37 

A functional bass 
trap that has been 
placed in a córner to 
prevent bass buildup. 


Functional trap Originally created in the 1950s by Harry F. Olson (former di¬ 
rector of RCA Labs), the functional bass trap (Figure 3.37) uses a material gener- 
ally formed into a tube or half-tube structure that is rigidly supported so as to 
reduce structural vibrations. By placing these devices into corners, room bound- 
aries or in a freestanding spot, a large portion of the undesired bass buildup fre- 
quencies can be absorbed. By placing a reflective surface over the portion of the 
trap that faces into the room, frequencies above 400 Hz can be dispersed back 
into the room or focal point. Figure 3.38 shows how these traps can be used in 
the studio to break up reflections and reduce bass buildup. 


ROOM REFLECTIONS AND ACOUSTIC 
REVERBERATION 

Another criterion for studio design is the need for a desirable room ambience 
and intelligibility, which is often contradictory to the need for good acoustic 
separation between instruments and their pickup. Each of these factors is gov- 
erned by the careful control and tuning of the reverberation constants within 
the studio over the frequency spectrum. 

Reverberation ( reverb ) is the persistence of a signal (in the form of reflected waves 
within an acoustic space) that continúes after the original sound has ceased. 
The effect of these closely spaced and random múltiple echoes gives us percep¬ 
tible cues as to the size, density and nature of an acoustic space. Reverb also 
adds to the perceived warmth and spatial depth of recorded sound and plays 
an extremely important role in the perceived enhancement of music. 


Room Reflections and Acoustic Reverberation 




FIGURE 3.38 

Quick Sound Field. 
(Courtesy of Acoustic 
Sciences Corporation, 
www.tubetrap.com.) 


As was stated in the latter part of Chapter 2, 
the reverberated signal itself can be broken 
down into three components: 

■ Direct sound 

■ Early reflection 

■ Reverb. 

The direct signal is made up of the original, inci- 
dent sound that travels from the source to the 
listener. Early reflections consist of the first few 
reflections that are projected to the listener ofif of 

4 1 major boundaries within an acoustic space; these 

reflections generally give the listener subcon- 
scious cues as to the size of the room. (It should 
i J be noted that strong reflections off of large, 

| nearby surfaces can potentially have detrimental 

cancellation effects that can degrade a room's 
sound and frequency response at the listening 
position.) The last set of signal reflections makes 
up the actual reverberation characteristic. These 
signáis are composed of random reflections that 
travel from boundary to boundary in a room and 
are so closely spaced that the brain can't discern 
the individual reflections. When combined, they are perceived as a single decay- 
ing signal. 

Technically, reverb is considered to be the time that's required for a sound to 
die away to a millionth of its original intensity (resulting in a decrease over time 
of 60 dB), as shown by the following formula: 



RT 60 = VxO.049/AS 


where RT is the reverberation time (in sec), V is the volume of the enclosure 
(in ft 3 ), A is the average absorption coefficient of the enclosure, and S is the 
total surface area (in ft 2 ). As you can see from this equation, reverberation time 
is directly proportional to two major factors: the volume of the room and the 
absorption coefficients of the studio surfaces. A large environment with a rela- 
tively low absorption coefficient (such as a large cathedral) will have a relatively 
long RT 60 decay time, whereas a small studio (which might incorpórate a heavy 
amount of absorption) will have a very short RT 60 . 

The style of music and the room application will often determine the optimum 
RT 60 for an acoustical environment. Reverb times can range from 0.25 sec in a 
smaller absorptive recording studio environment to 1.6 sec or more in a larger 
music or seo ring studio. In certain designs, the RT 60 of a room can be altered 
to fit the desired application by using movable panels or louvers or by placing 
carpets in a room. Other designs might sepárate a studio into sections that 
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exhibit different reverb constants. One side of the studio (or sepárate iso-room) 
might be relatively nonreflective or dead, whereas another section or room 
could be much more acoustically live. The more reflective, live section is often 
used to bring certain Instruments that rely heavily on room reflections and 
reverb, such as strings or an acoustic guitar, to "Ufe." The recording of any 
number of instruments (including drums and percussion) can also greatly 
benefit from a well-designed acoustically live environment. 

Isolation between different instruments and their pickups is extremely impor- 
tant in the studio environment. If leakage isn't controlled, the room's effective- 
ness becomes severely limited over a range of applications. The studio designs 
of the 1960s and 1970s brought about the rise of the "sound sucker" era in 
studio design. During this time, the absorption coefficient of many rooms was 
raised almost to an anechoic (no reverb) condition. With the advent of the 
music styles of the 1980s and a return to the respectability of live studio acous¬ 
tics, modern studio and control-room designs have begun to increase in size 
and "liveness" (with a corresponding increase in the studio's RT 60 ). This has 
reintroduced the buying public to the thick, live-sounding music production of 
earlier decades, when studios were larger structures that were more attuned to 
capturing the acoustics of a recorded instrument or ensemble. 

ACOUSTIC ECHO CHAMBERS 

Another physical studio design that was used extensively in the past (before the 
invention of artificial effects devices) for re-creating room reverberation is the 
acoustic echo chamber. A traditional echo chamber is an isolated room that has 
highly reflective surfaces into which speakers and microphones are placed. The 
speakers are fed from an effects send, while the mic's reverberant pickup is fed 
back into the mix via an input strip of effects return. By using one or more 
directional mies that have been pointed away from the room speakers, the direct 
sound pickup can be minimized. Movable partitions also can be used to vary 
the room's decay time. When properly designed, acoustic echo chambers have 
a very natural sound quality to them. The disadvantage is that they take up space 
and require isolation from external sounds; thus, size and cost often make it 
unfeasible to build a new echo chamber, especially those that can match the 
caliber and quality of high-end digital reverb devices. 

An echo chamber doesn't have to be an expensive, built-from-the-ground-up 
design. Actually, a temporary chamber can be made from a wide range of 
acoustic spaces to pepper your next project with a bit of "acoustic spice." For 
example: 

■ An ambient-sounding chamber can be built by placing a Blumlein (crossed 
figure-8) pair or spaced stereo pair of mies in the main studio space and 
feeding a send to the studio playback monitors (Figure 3.39). 

■ A speaker/mic setup could be placed in an empty garage (as could a guitar 
amp/mic, for that matter). 

■ An empty stairwell often makes an excellent chamber. 
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FIGURE 3.39 

Example of how a 
room or studio space 
can be used as a 
temporary echo 
chamber. 



■ Any vocalist could tell you what'll happen if you place a singer or guitar 
speaker/mic setup in the shower. 

From the above, it's easy to see that ingenuity and experimentation are often 
the ñame of the makeshift chamber game. In fact, there's nothing that says that 
the chamber has to be a real-time effect ... for example, you could play back a 
song's effects track from a laptop DAW into a church's acoustic space and record 
the effect back to stereo tracks on the DAW. The options and limitless experi- 
mentations are totally up to you! 




CHAPTER 4 


Microphones: Design 
and Application 


THE MICROPHONE: AN INTRODUCTION 

A microphone (often called a mic ) is usually the first device in a recording chain. 
Essentially, a mic is a transducer that changes one form of energy (sound waves) 
into another corresponding form of energy (electrical signáis). The quality of 
its pickup will often depend on external variables (such as placement, distance, 
instrument and the acoustic environment), as well as on design variables 
(such as the microphones operating type, design characteristics and quality). 
These interrelated elements tend to work together to affect the overall sound 
quality. 

In order to deal with the wide range of musical, acoustic and situational cir- 
cumstances that might come your way (not to mention your own personal 
taste), a large number of mic types, styles and designs can be pulled out of our 
"sonic toolbox." Because the particular characteristics of a mic might be best 
suited to a specific range of applications, engineers and producers use their 
artistic talents to get the best possible sound from an acoustic source by carefully 
choosing the mic or mies that fit the specific pickup application at hand. 

The road to considering microphone choice and placement is best traveled by 
considering a few simple rules: 


Rule 1: There are no rules , only guidelines. Although guidelines can help you 
achieve a good pickup, don't hesitate to experiment in order to get a 
sound that best suits your needs or personal taste. 

Rule 2: The overall sound of an audio signal is no better than the weakest link in 
the signal path. If a mic or its placement doesn't 
sound as good as it could, make the changes 
to improve it BEFORE you commit it to 
tape, disc or whatever. More often than 
not, the concept of "fixing it later in the 
mix" will often put you in the unfortu- 
nate position of having to correct a situ- 
ation after the fact, rather than recording 
the best sound and/or performance during 
the initial session. 


THE "GOOD RULE" 

Good musician + good instrument 
+ good performance + good acoustics 
+ good mic + good placement 
= good sound. 
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Rule 3: Whenever possible, use the "Good Rule": Good musician + good instrument 
+ good performance + good acoustics + good mike + good placement = good 
sound. This rule refers to the fact that a music track will only be as good 
as the performer, instrument, mic placement and the mic itself. If any of 
these elements falls short of its potential, the track will suffer accordingly. 
However, if all of these links are the best that they can be, the recording 
will almost always be something that you'll be proud of! 

The miking of vocals and instruments (both in the studio and onstage) is defi- 
nitely an art form. It's often a balancing act to get the most out of the Good 
Rule. Sometimes you'll have the best of all of the elements; at others, you'll have 
to work hard to make lemonade out of a situational lemon. The best rule of all 
is to use common sense and to trust your instincts. 

Before delving into placement techniques and facts that deal with the finer 
points of microphone technology, I'd like to take a basic look at how micro- 
phones (and their operational characteristics) work. Why do I put this in the 
book? Well, from a personal standpoint, having a basic understanding of what 
happens "under the hood" has helped me to get a mental image of how a par¬ 
ticular mic or mic technique will work in a certain situation. Basically it helps 
to make judgments that can be combined with my own intuition to make the 
best artistic judgment at the time ... I hope it will help you, as well. 

MICROPHONE DESIGN 

A microphone is a device that converts acoustic energy into corresponding 
electrical voltages that can be amplified and recorded. In audio production, 
three transducer mic types are used: 

■ Dynamic mic 

■ Ribbon mic 

■ Condenser mic. 


THEORY OF 

ELECTROMAGNETIC INDUCTION 

Whenever an electrically conductive 
metal cuts across the flux lines of a magnetic 
field, a current of a specific magnitude and 
direction ujíII be generated ujithin 
that metal. 


The dynamic microphone 

In principie, the dynamic mic (Figure 4.1) 
operates by using electromagnetic induc- 
tion to generate an output signal. The 
simple theory of electromagnetic induction 
States that whenever an electrically con¬ 
ductive metal cuts across the flux lines 
of a magnetic field, a current of a specific 
magnitude and direction will be generated 
within that metal. 


Dynamic mic designs (Figure 4.2) generally consist of 
a stiff Mylar diaphragm of roughly 0.35-mil thickness. Attached to the dia- 
phragm is a finely wrapped core of wire (called a voice coil) that's precisely 
suspended within a high-level magnetic field. Whenever an acoustic pressure 
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FIGURE 4.1 

The Shure Beta 58A 
dynamic mic. 
(Courtesy of Shure 
Incorporated, www. 
shure.com.) 



magnetic 

housing 

moving 
coi! 

output 
leads 



FIGURE 4.2 

Inner workings of a dynamic microphone. 


wave hits the diaphragm's face (A), the attached 
voice coil (B) is displaced in proportion to the 
amplitude and frequency of the wave, causing 
the coil to cut across the lines of magnetic flux 
that's supplied by a permanent magnet (C). In 
doing so, an analogous electrical signal (of a 
specific magnitude and direction) is induced into 
the coil and across the output leads, thus produc- 
ing an analog audio output signal. 



The ribbon microphone 

Like the dynamic microphone, the ribbon mic also works on the principie of 
electromagnetic induction. Older ribbon design types, however, use a diaphragm 
of extremely thin aluminum ribbon (2 microns). Often, this diaphragm is cor- 
rugated along its width and is suspended within a strong field of magnetic flux 
(Figure 4.3). Sound-pressure variations between the front and the back of the 
diaphragm cause it to move and cut across these flux lines, inducing a current 
into the ribbon that's proportional to the amplitude and frequency of the 
acoustic waveform. Because the ribbon generates a small output signal (when 
compared to the larger output that's generated by the múltiple wire turns of a 
moving coil), its output signal is too low to drive a microphone input stage 


FIGURE 4.3 

Cutaway detaiI of a 
ribbon microphone. 
(Courtesy of Audio 
Engineering 
Associates, www. 
ribbonmics.com.) 
















































Microphone Des gn 


FIGURE 4.4 

The AEA A440 ribbon 
mic. (Courtesy of 
Audio Engineering 
Associates, www. 
ribbonmics.com.) 





FIGURE 4.6 

The Beyerdynamic MI 60 ribbon mic. (Courtesy of Beyerdynamic, www. 
beyerdynamic.com.) 


directly; thus, a step-up transformer must be used to boost the 
output signal and impedance to an acceptable range. 

Until recently, traditional ribbon technology could be only found 
on the original, vintage mies (such as the older RCA and Colé 
ribbon mies); however, with the skyrocketing price of vintage mies 
and a resurgence in the popularity of the smooth, transient quality 
of the "ribbon sound," newer mies that follow the traditional 


FIGURE 4.5 

Cascade 

Microphones Fat 
Head ribbon mic. 
(Courtesy of Cascade 
Microphones, www. 
cascademicrophones. 
com.) 


design philosophies have begun to spring up on the market 
(Figures 4.4 and 4.5). 

RECENT DEVELOPMENTS IN RIBBON TECHNOLOGY 

During the past several decades, certain microphone manufacturers have 
made changes to original ribbon technologies by striving to miniaturize and 
improve their basic operating characteristics. The popular MI60 (Figure 4.6) 
and M260 ribbon mies from Beyerdynamic use a rare-earth magnet to produce 
a capsule that's small enough to fit into a 2-inch grill ball (much smaller than 
a traditional ribbon-style mic). The ribbon (which is corrugated along its length 
to give it added strength and at each end to give it flexibility) is 3 microns thick, 
about0.08 inchwide, 0.85 inch long and weighs only 0.000011 ounce. A plástic 
throat is fitted above the ribbon, which houses a pop-blast filter. Two additional 
filters and the grill greatly reduce the ribbon's potential for blast and wind 
damage, a feature that has made these designs suitable for outdoor and 
handheld use. 
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Another relatively recent advance in ribbon technology has been the develop- 
ment of the printed ribbon mic. In principie, the printed ribbon operates in 
precisely the same manner as the conventional ribbon pickup; however, the 
rugged diaphragm is made from a polyester film that has a spiral aluminum 
ribbon printed onto it. Ring magnets are then placed at the diaphragm's front 
and back, thereby creating a wash of magnetic flux that makes the electromag- 
netic induction process possible. 

Other alterations to traditional ribbon technology make use of phantom power 
to supply power to an active, internal amplifier (Figure 4.6), so as to boost the 
mic's output to that of a dynamic or condenser mic, without the need for a 
passive transformer (an explanation of phantom power can be found in the 
next section on condenser mies). 


The condenser microphone 



Condenser mies (like the capsules which are shown in Figures 4.7 and 4.8) 
opérate on an electrostatic principie rather than the electromagnetic principie 
used by a dynamic or ribbon mic. The capsule of a basic condenser mic 
consists of two plates: one very thin movable diaphragm and one fixed back- 
plate. These two plates form a capacitor (or condenser, as it is still called in the 
UK and in many parís of the world). A capacitor is an electrical device that's 
capable of storing an electrical charge. The 
amount of charge that a capacitor can store is 
determined by its capacitance valué and the 
voltage that's applied to it, according to the 
formula: 


FIGURE 4.7 

Exposed example 
of a condenser 
diaphragm. (Courtesy 
of ADK, www. 
adkmic.com; 
photograph by K. 
Bujack.) 


1 Q = CV 


where Q is the charge (in coulombs), C is the 
capacitance (in farads), and V is the voltage (in 
volts). 


At its most basic level, a condenser mic 
operates when a regulated DC power supply 
is applied between its diaphragm plates to create 
a capacitive charge. When sound acts upon 
the movable diaphragm, the varying distance 
between the plates will likewise create a change 
in the device's capacitance (Figure 4.9). Accord¬ 
ing to the above equation, if Q (the power supply 
charge) is constant and C (the diaphragm's 
capacitance) changes, then V (voltage across the 
diaphragm) will change in a proportional and 
inverse fashion. In other words: 
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FIGURE 4.8 

Inner detail of an 
AKG C3000B 
condenser mic. 
(Courtesy of AKG 
Acoustics, Inc., www. 
akg.com.) 



FIGURE 4.9 

Output and potential 
relationships as a 
result of changing 
capacitance. 



C is constant 
V is constant 


C decreases 
V increases 


C increases 
V decreases 


Since the charge (Q) is known to be constant and the 
diaphragm’s capacitance (C) changes with differences in 
sound pressure, the voltage (V) must change in inverse 


proportion. Given that the capsule’s voltage now changes 
in proportion to the sound waves that act upon ¡t, voilá ... 
we have a condenser mic! 


The next trick is to tap into the Circuit to capture the changes in output voltage, 
by placing a high-value resistor across the Circuit. Since the voltage across the 
resistor will change in inverse proportion to the capacitance across the capsule 
plates, this signal will then become the mic's output signal (Figure 4.10). 
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Since the resulting signal has an extremely high imped- 
ance, it must be fed through a preamplifier in order to 
preserve the mic's frequeney response characteristics. 

Since this amp must be placed at a point just following 
the resistor (often at a distance of 2 inches or less), it is 
almost always placed within the mic's body in order to 
prevent hum, noise pickup and signal-level losses. In 
addition to the need for a polarizing voltage, the preamp 
is another reason why conventional condenser micro¬ 
phones require a supply voltage in order to opérate. 

PHANTOM POWER 

Most modern professional condenser (and some ribbon) 
mies don't require internal batteries, external battery 
packs or individual AC power supplies in order to opérate. 

Instead, they are designed to be powered directly from the 
consolé through the use of a phantom power supply. 

Phantom power works by supplying a positive DC supply 
voltage of +48 V through both audio conductors (pins 2 
and 3) of a balanced mic line to the condenser capsule and preamp. This voltage 
is equally distributed through identical valué resistors, so that no differential 
exists between the two leads. The -48-V side of the Circuit is supplied to the 
capsule and preamp through the cables grounding wire (pin 1). 

Since the audio is only affected by potential differences between pins 2 and 3 
(and not the ground signal on pin 1), the carefully matched +48-V potential at 
these leads is therefore not electrically "visible" to the input stage of a balanced 
mic preamp. Instead, only the balanced, alternating audio signal that's being 
simultaneously carried along the two audio leads will be detected (Figure 4.11). 


d -Ad 


voltage ¿ 
source 


Av 


-• 

output - 

FIGURE 4.10 

As a sound wave 
decreases the 
condenser spacing 
by d the capacitance 
will increase, causing 
the voltage to 
proportionately fall 
(and vice versa). 



The resistors (R) used for distributing power to the signal leads should be 1/4-W 
resistors with a ±1% tolerance and have the following valúes for the following 
supply voltages (because some mies can also be designed to work at voltages 
lower than 48 V): 6.8 kQ for 48 V, 1.2 kQ for 24 V, and 680 Q for a 12-V supply. 
In addition to precisely matching the supply voltages, these resistors also provide 
a degree of power isolation between other mic inputs on a consolé. If a signal 
lead were accidentally shorted to ground (which could happen if defective 
cables or unbalanced XLR cables were used), the power supply should still be 
able to deliver power to other mies in the system. If two or more inputs were 
accidentally shorted, however, the phantom voltage could drop to levels that 
would be too low to be usable. 


Although most modern condensers use some form of a field effect transistor 
(FET) to reduce the capsule impedance, an increasing number of original era 
and "revival" models use an internally housed vacuum tube to amplify and 
change the impedance of the condenser capsule. These mies are generally valued 
by studios and collectors alike for their "tube" sound, which results from 
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+48V center tap off of the transformer 
(or equivalen! transformerless Circuit) 



to powcr capsule and/or preamp 


ground for 48V 
phantom supply 


FIGURE 4.11 

Schematic drawing 
of a phantom power 
system. 


even-harmonic distortion and other sonic characteristics that occur whenever 
tubes are used. 


THE ELECTRET-CONDENSER MICROPHONE 

Electret-condenser mies work on the same operating principies as their externally 
polarized counterparts, with the exception that a static polarizing charge has 
been permanently set up between the mic's diaphragm and its backplate. Since 
the charge (Q) is built into the capsule, no external source is required to power 
the diaphragm. However, as with a powered condenser mic, the capsule's output 
impedance is so high that a preamp will still be required to reduce it to a stan¬ 
dard valué. As a result, a battery, external powering source or standard phantom 
supply must be used to power the low-current amp. 


D I Y 

do it yourself 


1. Go to the tutorial section of www.modrec.com, click 
on “Mic Types” and download the soundfiles (which 
inelude examples of each mic operating type). 

2. Listen to the tracks. If you have access to an editor or 
digital audio workstation (DAW), import the files and 
look at the waveform amplitudes for each example. If 
you’d like to DIY, then ... 


Pulí out several mies from each operating type and 
plug them in (if you don’t have several types, maybe a 
studio, your school or a friend has a few you can take 
out for a spin). Try each one on an ¡nstrument and/or 
vocal. Are the differences between operating types 
more noticeable than between models in the same 
family? 


Tutorial: Mic Types 

3. 
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MICROPHONE 

CHARACTERISTICS 

To handle the wide range of applications that are 
encountered in studio, project and on-location 
recording, microphones will often differ in their 
overall sonic, electrical and physical characteris- 
tics. The following section highlights many of on .axi s 
these characteristics in order to help you choose 
the best mic for a given application. 


Directional response 

The directional response of a mic refers to its sen- 
sitivity (output level) at various angles of inci- 
dence with respect to the front (on-axis) of the 
microphone (Figure 4.12). This angular response 
can be graphically charted in a way that shows a micro- 
phone's sensitivity with respect to direction and frequency 
over 360°. Such a chart is commonly referred to as the 
mic's polar pattern. Microphone directionality can be clas- 
sified into two categories: 

■ Omnidirectional polar response 

■ Directional polar response. 

The omnidirectional mic (Figure 4.13) is a pressure-operated 
device that's responsive to sounds that emanate from all 
directions. In other words, the diaphragm will react equally 
to all sound-pressure fluctuations at its surface, regardless 
of the source's location. Pickups that display directional 
properties are pressure-gradient devices, meaning that the 
pickup is responsive to relative differences in pressure 
between the front, back and sides of a diaphragm. For 
example, a purely pressure-gradient mic will exhibit a bidi- 
rectional polar pattern (commonly called &figure-8 pattern), 
as shown in Figure 4.14. Many of the older ribbon mies exhibit a bidirectional 
pattern. Since the ribbon's diaphragm is often exposed to sound waves from 
both the front and rear axes, it's equally sensitive to sounds that emanate from 
either direction. Sounds from the rear will produce a signal that's 180° out of 
phase with an equivalent on-axis signal (Figure 4.15a). Sound waves arriving 
90° off-axis produce equal but opposite pressures at both the front and rear of 
the ribbon (Figure 4.15b), resulting in a cancellation at the diaphragm and no 
output signal. 

Figure 4.16 graphically illustrates how the acoustical combination (as well as 
electrical and mathematical combination, for that matter) of a bidirectional 


180 “ 


FIGURE 4.12 

Directional axis of a 
microphone. 



FIGURE 4.13 

Graphic 
representation 
of a typical 
omnidirectional 
pickup pattern. 
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FIGURE 4.14 

Graphic representa¬ 
ron of a typical 
bidirectional pickup 
pattern. 



► 


FIGURE 4.15 

Sound sources 
on-axis and 90° 
off-axis at the 
ribbon’s diaphragm. 

(a) The ribbon is 
sensitive to sounds 
at the front and rear. 

(b) Sound waves 
from 90° and 270° 
off-axis are canceled. 


rear sound 



(a) 



front sound 
source 



(pressure-gradient) and omnidirectional (pressure) pickup can be combined 
to obtain other directional pattern types. Actually, an infinite number of 
directional patterns can be obtained from this mixture, with the most widely 
known patterns being the cardioid, supercardioid and hypercardioid polar patterns 
(Figure 4.17). 
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Often, dynamic mies achieve a cardioid response (named after its heart-shaped 
polar chart, as shown in Figure 4.18) by incorporating a rear port into their 
design. This port serves as an acoustic labyrinth that creates an acoustic resis- 
tance (delay). In Figure 4.19a, a dynamic pickup having a cardioid polar 
response is shown receiving an on-axis (0 o ) sound signal. In effect, the dia- 
phragm receives two signáis: the incident signal, which arrives from the front, 
and an acoustically delayed rear signal. In this instance, the on-axis signal exerts 
a positive pressure on the diaphragm and begins its travels 90° to a port located 
on the side of the pickup. At this point, the signal is delayed by another 90° 
(using an internal, acoustically resistive material or labyrinth). In the time it 
takes for the delayed signal to reach the rear of the diaphragm (180°), the 
on-axis signal moves on to the negative portion of its acoustic eyele and then 
begins to exert a negative pressure on the diaphragm (pulling it outward). Since 
the delayed rear signal is 180° out of phase at this point in time, it will also 
begin to push the diaphragm outward, resulting in an output signal. 

Conversely, when a sound arrives at the rear of the mic, it begins its trek around 
to the mic's front. As the sound travels 90° to the side of the pickup, it is again 
delayed by another 90° before reaching the rear of the diaphragm. During this 
delay period, the sound continúes its journey around to the front of the mic—a 
delay shift that's also equal to 90°. Since the acoustic pressures at the dia- 
phragm's front and rear sides are equal and opposite, the sound is being simul- 
taneously pushed inward and outward with equal forcé, resulting in little or no 
movement ... and therefore will have little or no output signal (Figure 4.19b). 
The attenuation of such an off-axis signal, with respect to an equal on-axis 
signal, is known as its front-to-back discrimination and is rated in decibels. 

Certain condenser mies can be electrically switched from one pattern to another 
by using a second capsule that's mounted on both sides of a central backplate. 
Configuring these dual-capsule systems electrically in phase will create an 


FIGURE 4.16 

Directional combina- 
tions of various 
bidirectional and 
nondirectional pickup 
patterns. 
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FIGURE 4.17 

Various polar 
patterns with output 
sensitivity plotted 
versus angle of 
incidence. 
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omnidirectional pattern, while configuring them out of phase results in a 
bidirectional pattern. A number of intermedíate patterns (such as cardioid 
and hypercardioid) can be created by electrically varying between these two 
polar States (in either continuous or stepped degrees), as was seen earlier in 
Figure 4.16. 
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Frequency response 

The on-axis frequency-response curve of a micro- 
phone is the measurement of its output over the 
audible frequency range when driven by a con- 
stant, on-axis input signal. This response curve 
(which is generally plotted in output level [dB] 
over the 20- to 20,000-Hz frequency range) will 
often yield valuable information and can give 
clues as to how a microphone will react at spe- 
cific frequencies. It should be noted that a 
number of other variables also determine how a 
mic will sound, some of which have no measure¬ 
ment standards—the final determination should 
always be your own ears. 

A mic that's designed to respond equally to all 
frequencies is said to exhibit a fíat frequency 
response (shown as the top curve in Figure 
4.20a). Others can be made to emphasize or de- 
emphasize the high-, mid- or low-end response 
of the audio spectrum (shown as the boost in 
the high-end curve in Figure 4.20b) so as to give 
it a particular sonic character. The solid fre¬ 
quency-response curves (as shown in both parts 
a and b) were measured on-axis and exhibit an 
acceptable response. However, the same mies 
might exhibit a "peaky" or erratic curve when 
measured off-axis. These signal colorations 
could affect their sound when operating in an 
area where off-axis sound (in the form of 
leakage) arrives at the pickup (shown as the 
dotted curves in both parts a and b), and will 
often result in a tone quality change, when the 
leaked signal is mixed in with other properly 
miked signáis. 

At low frequencies, rumble (high-level vibrations 
that occur in the 3- to 25-Hz región) can be 
easily introduced into the surface of a large 
unsupported floor space, studio or hall from 
any number of sources (such as passing trucks, 
air conditioners, subways or fans). They can be 
reduced or eliminated in a number of ways, 
such as: 

■ Using a shock mount to isolate the mic 
from the vibrating surface and floor stand. 



on-axis 

signal 



FIGURE 4.18 

Graphic representa¬ 
ron of a typical 
cardioid pickup 
pattern. 





rear phase 
delay port 
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FIGURE 4.19 

The directional 
properties of a 
cardioid microphone. 
(a) Signáis arriving at 
the front (on-axis) of 
the diaphragm will 
produce a full output 
level. (b) Signáis 
arriving at the rear of 
the diaphragm (180°) 
will cancel each 
other out, resulting in 
a greatly reduced 
output. 
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FIGURE 4.20 

Frequency response 
curves: (a) response 
curve of the AKG 
C460B/CK61 ULS; 

(b) response curve 
of the AKG D321. 
(Courtesy of AKG 
Acoustics, Inc., www. 
akg-acoustics.com.) 
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■ Choosing a mic that displays a restricted low-frequency response 

■ Restricting the response of a wide-range mic by using a low-frequency 
roll-off filter. 

Another low-frequency phenomenon that occurs in most directional mies is 
known as proximity effect. This effect causes an increase in bass response when- 
ever a directional mic is brought within 1 foot of the sound source. This bass 
boost (which is often most noticeable on vocals) proportionately increases as 
the distance decreases. To compénsate for this effect (which is somewhat greater 
for bidirectional mies than for cardioids), a low-frequency roll-off filter switch 
(which is often located on the microphone body) can be used. If none exists, 
an external roll-off or equalizer can be used to reduce the low end. Any of these 
tools can be used to help restore the bass response to a fíat and natural-sound- 
ing balance. Another way to reduce or eliminate proximity effect and its associ- 
ated "popping" of the letters "p" and "b" is to replace the directional microphone 
with an omnidirectional mic when working at cióse distances. On a more 
positive note, this increase in bass response has long been appreciated by vocal- 
ists and DJs for their ability to give a full, "larger-than-life" quality to voices that 
are otherwise thin. In many cases, the use of a directional mic has become an 
important part of the engineer, producer and vocalist's toolbox. 
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C DIY J 

do it yourself 

Tutorial: Proximity Effect 

1. Pulí out omnidirectional, cardioid and bidirectional 3. Does the bass response increase as the distance is 

mies (or one that can be switched between these decreased with the cardioid? ... the bidirectional? ... 

patterns). the omni? 

2. Move in on each mic pattern type from distances of 3 
feet to 6 inches (being careful of volume levels and 
problems that can occur from popping). 


Transient response 

A significant piece of data (which currently has no accepted standard of measure) 
is the transient response of a microphone (Figure 4.21). Transient response is the 
measure of how quickly a mic's diaphragm will react when it is hit by an acous- 
tic wavefront. This figure varies wildly among microphones and is a major 
reason for the difference in sound quality among the three pickup types. For 
example, the diaphragm of a dynamic mic can be quite large (up to 2.5 inches). 
With the additional weight of the coil of wire and its core, this combination 
can be a very large mass when compared to the power of the sound wave that 
drives it. Because of this, a dynamic mic can be very slow in reacting to a 
waveform—often giving it a rugged, gutsy, and less accurate sound. 

By comparison, the diaphragm of a ribbon mic is much lighter, so its diaphragm 
can react more quickly to a sound waveform, resulting in a clearer sound. The 
condenser pickup has an extremely light diaphragm, which varies in diameter 
from 2.5 inches to less than % inch and has a thickness of about 0.0015 inch. 
This means that the diaphragm offers very little mechanical resistance to a 
sound-pressure wave, allowing it to accurately track the wave over the entire 
frequeney range. 

Output characteristics 

A microphones output characteristics refer to its measured sensitivity, equivalent 
noise, overload characteristics, impedance and other output responses. 

SENSITIVITY RATING 

A mic's sensitivity rating is the output level (in volts) that a microphone will 
produce, given a specific and standardized acoustic signal at its input (rated 
in dB SPL). This figure will specify the amount of amplification that's required 
to raise the mic's signal to line level (often referenced to -10 dBv or +4 dBm) 
and allows us to judge the relative output levels between any two mies. A 
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FIGURE 4.21 

Transient response 
characteristics of a 
percussive wood- 
block using various 
microphone types: 

(a) Shure SM58 
dynamic; (b) RCA 
44BX ribbon; (c) AKG 
C3000 condenser. 



(c) 


microphone with a higher sensitivity rating will produce a stronger output 
signal voltage than one with a lower sensitivity. 

EQUIVALENT NOISE RATING 

The equivalent noise rating of a microphone can be viewed as the device's electri- 
cal self-noise. It is expressed in dB SPL or dBA (a weighted curve) as a signal 
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that would be equivalent to the mic's self-noise voltage. As a general rule, the 
mic itself doesn't contribute much noise to a system when compared to the 
mixer's amplification stages, the recording system or media (whether analog or 
digital). However, with recent advances in mic preamp/mixer technologies and 
overall reductions in noise levels produced by digital systems, these noise ratings 
have become increasingly important. Interestingly enough, the internal noise 
of a dynamic or ribbon pickup is actually generated by the electrons that move 
within the coil or ribbon itself. Most of the noise that's produced by a condenser 
mic is generated by the built-in preamp. It almost goes without saying that 
certain microphone designs will have a higher degree of self-noise than will 
others; thus, care should be taken in your microphone choices for critical appli- 
cations (such as with distant classical recording techniques). 

OVERLOAD CHARACTERISTICS 

Just as a microphone is limited at low levels by its inherent self-noise, it's also 
limited at high sound-pressure levels (SPLs) by overload distortion. In terms of 
distortion, the dynamic microphone is an extremely rugged pickup, often 
cap able of an overall dynamic range of 140 dB. Typically, a condenser micro¬ 
phone won't distort, except under the most severe sound-pressure levels; 
however, the condenser system differs from the dynamic in that at high acous- 
tic levels the capsules output might be high enough to overload the mic's 
preamplifier. To prevent this, most condenser mies offer a switchable attenua- 
tion pad that immediately follows the capsule output and serves to reduce the 
signal level at the preamp's input, thereby reducing or eliminating overload 
distortion. When inserting such an attenuation pad into a Circuit, keep in mind 
that the mic's signal-to-noise ratio will be degraded by the amount of attenua¬ 
tion; therefore, it's always wise to remove the inserted pad when using the 
microphone under normal conditions. 

MICROPHONE IMPEDANCE 

Microphones are designed to exhibit different output impedances. Output imped- 
ance is a rating that's used to help you match the output resistance of one device 
to the rated input resistance requirements of another device (so as to provide 
the best-possible level and frequeney response matching). 

Impedance is measured in ohms (with its Symbol being Q or Z). The most 
commonly used microphone output impedances are 50, 150 and 250 Q (low) 
and 20 to 50 kQ (high). Each impedance range has its advantages. In the past, 
high-impedance mies were used because the input impedances of most tube- 
type amplifiers were high. A major disadvantage to using high-impedance mies 
is the likelihood that their cables will pick up electrostatic noise (like those 
caused by motors and fluorescent lights). To reduce such interference, a shielded 
cable is necessary, although this begins to act as a capacitor at lengths greater 
than 20 to 25 feet, which serves to short out much of the high-frequeney infor- 
mation that's picked up by the mic. For these reasons, high-impedance micro¬ 
phones are rarely used in the professional recording process. 
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Most modern-day Systems, on the other hand, are commonly designed to accept 
a low-impedance microphone source. The lines of very-low-impedance mies 
(50 Q) have the advantage of being fairly insensitive to electrostatic pickup. 
They are, however, sensitive to induced hum pickup from electromagnetic fields 
(such as those generated by AC power lines). This extraneous noise can be 
greatly reduced through the use of a twisted-pair cable, because the interference 
that's magnetically induced into the cable will flow in opposite directions along 
the cables length and will cancel out at the consolé or mixer's balanced micro¬ 
phone input stage. Mic lines of 150 to 250 Q are less susceptible to signal losses 
and can be used with cable lengths of up to several thousand feet. They're also 
less susceptible to electromagnetic pickup than the 50-Q lines but are more 
susceptible to electrostatic pickup. As a result, most professional mies opérate 
with an impedance of 200 Q, use a shielded twisted-pair cable and have reduced 
noise through the use of a balanced signal line. 

BALANCED/UNBALANCED LINES 

In short, a balanced line uses three wires to properly carry the audio signal. Two 
of the wires are used to carry the signal voltage, while a third lead is used as a 
neutral ground wire. Since neither of the two signal conductors of a balanced 
line is directly connected to the signal ground, the alternating current of an 
audio signal will travel along the two independent wires. From a noise stand- 
point, whenever an electrostatic or electromagnetic signal is induced across the 
audio leads, it will be induced into both of the audio leads at an equal level 
(Figure 4.22). Since the input of a balance device will only respond to the 
alternating voltage potentials between the two leads, the unwanted noise (which 
is equal and opposite in polarity) will be canceled. 


The standard that has been widely adopted for the hot) and pin 3 as being negative (- or neutral), with the 

proper polarity of two-conductor, balanced, XLR cable ground being connected to pin 1. 

connector cables specifies pin 2 as being positive (+ or 


If the hot and neutral pins of balanced mic cables are haphazardly pinned in 
a music or production studio, it's possible that any number of mies (and 
other equipment, for that matter) could be wired in opposite, out-of-phase 
polarities. For example, if a single instrument were picked up by two mies using 
two improperly phased cables, the instrument might totally or partially cancel 
when mixed to mono. For this reason, it's always wise to use a phase tester or 
volt-ohm meter to check the cable wiring throughout a pro or project studio 
complex. 
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FIGURE 4.22 

Wiring detaiI of a 
balanced microphone 
cable (courtesy of 
Loud Technologies 
Inc., www.mackie. 
com): (a) diagram for 
wiring a balanced 
microphone (or line 
source) to a balanced 
XLR connector; 

(b) physical drawings; 

(c) diagram for wiring 
a balanced 1/4-inch 
phone connector; 

(d) equivalent Circuit, 
where the induced 
signáis travel down 
the wires in equal 
polarities that cancel 
at the transformer, 
whereby the AC 
audio signáis are of 
opposing polarities 
that generate an 
output signal. 


































































Microphone Preamps 


unbalanced line 


i 

i 


\ 

i 

4 




g round 
(common) 

cold 

() 

hot 

( + ) 




ground 

(-) 


hot 

t+) 





hot 


j ground 

(d) 

FIGURE 4.23 

Unbalanced microphone Circuit (courtesy of Loud Technologies Inc., www.mackie.com): (a) diagram for wiring an 
unbalanced microphone (or line source) to a balanced XLR connector; (b) diagram for wiring an unbalanced 1/4-inch 
phone connector; (c) physical drawings; (d) equivalent Circuit. 


High-impedance mies and most line-level instrument lines use unbalanced 
Unes (Figure 4.23) to transmit signáis from one device to another. In an unbal¬ 
anced circuit, a single signal lead carries a positive current potential to a device, 
while a second, grounded shield (which is tied to the chassis ground) is used 
to complete the circuit's return path. When working at low signal levels (espe- 
cially at mic levels), any noises, hums, buzzes or other types of interference 
that are induced into the signal path will be amplified along with the input 
signal. 

MICROPHONE PREAMPS 

Since the output signáis of most microphones are at levels far too low to drive 
the line-level input stage of most recording systems, a mic preamplifier must be 
used to boost its signal to acceptable levels (often by 30 to 70 dB). With the 
advent of improved technologies in analog and digital consolé design, hard-disk 
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recorders, DAWs, signal processors and the like, low noise and distortion figures 
have become more important than ever. To many professionals, the stock mic 
pres (pronounced "preeze") that are designed into many consolé types don't 
have that special "sound," aren't high enough in quality to be used in critical 
applications or don't have enough of a special, boutique cache for that special 
application. As a result, outboard mic preamps are chosen instead (Figures 4.24 
through 4.27) for their low-noise, low-distortion specs and/or their unique 
sound. These devices might make use of tube, FET and/or integrated Circuit 
technology, and offer advanced features in addition to the basic variable input 
gain, phantom power and high-pass filter Controls. As with most recording 
tools, the sound, color scheme, retro style, tube or transistor type and budget 
level are up to the individual, the producer and the artist ... it's totally a matter 
of personal style and taste. Note that mic pres have tapped into the growing 



FIGURE 4.24 

PreSonus TubePre. 
(Courtesy of 
PreSonus Audio 
Electronics, www. 
presonus.com.) 



FIGURE 4.25 

Grace MI 01 mic 
preamplifier. 
(Courtesy of Grace 
Design, www. 
gracedesign.com.) 



FIGURE 4.26 

Millennia HV-3D mic 
preamp. (Courtesy of 
Millennia Music & 
Media Systems, 
www.mil-media. 
com.) 



FIGURE 4.27 

Xlogic SuperAnalogue 
mic amp. (Courtesy 
of Solid State Logic, 
www.solid-state- 
logic.com.) 
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market of those Systems that are based around a DAW, which doesn't need a 
consolé or mixer but does require a quality pre (or set of pres) for plugging mic 
signáis directly into the interface. 

MICROPHONE TECHNIQUES 

Most microphones have a distinctive sound character that's based on its specific 
type and design. A large number of types and models can be used for a variety 
of applications, and it's up to the engineer to choose the right one for the 
job. Over the years, I've come to the realization that there are two particular 
paths that one can take when choosing the types and models of microphones 
for a studio's production toolbox. These can basically be placed into the 
categories of: 

■ Selecting a limited range of mies that are well suited for a wide range of 
applications 

■ Acquiring a larger collection of mies that are commonly perceived as being 
individually suited for a particular instrument or situation. 

The first approach is ideal for the project studio and those who are just starting 
out and are on a limited budget. It is also common practice among seasoned 
professionals who swear by a limited collection of their favorite mies that are 
chosen to cover a wide range of applications. These dynamic and/or condenser 
mies can be used both in the project studio and in the professional studio to 
achieve the best possible sound on a budget. 

The second approach (I often refer to it as the "Alan Sides" approach) is better 
suited to the professional studio (and to personal collectors) who actually have 
a need or desire to amass their own "dream collection" and offer it to their 
clients. In the end, both approaches have their merits. ... Indeed, it's usually 
wise to keep an open mind and choose a range of mic types that best fit your 
needs, budget and personal style. 

Choosing the appropriate mic, however, is only half the story. The placement 
of a microphone will often play just as important a role, and is one of the 
engineer's most valued tools. Because mic placement is an art form, there is no 
right or wrong. Placement techniques that are currently considered "bad" might 
easily be the accepted as being standard practice five years from now ... and as 
new musical styles develop, new recording techniques will also tend to evolve, 
helping to breathe new life into music and production. The craft of recording 
should always be open to change and experimentation—two of the strongest 
factors that keep the music and the biz of music alive and fresh. 

Pickup characteristics as a function of working distance 

In studio and sound-stage recording, four fundamental styles of microphone 
placement are directly related to the working distance of a microphone from its 
sound source. These extremely important placement styles are as important as 
any tool in the toy box: 
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FIGURE 4.28 

Example of an overall 
distant pickup. 


■ Distant miking 

■ Cióse miking 

■ Accent miking 

■ Ambient miking. 

DISTANT MICROPHONE PLACEMENT 

With distant microphone placement (Figure 4.28), one or more mies can be 
positioned at a distance of 3 feet or considerably more from the intended signal 
source. This technique (whose distance will vary with room and instrument 
size) will often yield the following results: 

■ It can pick up a large portion of a musical instrument or ensemble, thereby 
preserving the overall tonal balance of that source. Often, a natural tone 
balance can be achieved by placing the mic at a distance that's roughly 
equal to the size of the instrument or sound source. 

■ It allows the room's acoustic environment to be picked up (and naturally 
mixed in) with the direct sound signal. 

Distant miking is often used to pick up large instrumental ensembles (such as a 
symphony orchestra or choral ensemble). In this application, the pickup will 
largely rely on the acoustic environment to help achieve a natural, ambient 
sound. The mic should be placed at a distance so as to strike an overall balance 
between the ensemble's direct sound and the room's acoustics, giving a balance 
that's determined by a number of factors, including the size of the sound source, 
its overall volume level, mic distance and placement and the reverberant char- 
acteristics of the room. 
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FIGURE 4.29 

Resulting frequency 
response from a 
microphone that 
receives a direct and 
delayed sound from 
a single source. 


This technique tends to add a live, open feeling to a recorded sound; however, 
it could put you at a disadvantage if the acoustics of a hall, church or studio 
aren't particularly good. Improper or bad room reflections can create a muddy 
or poorly defined recording. To avoid this, the engineer might take one of the 
following actions: 

■ Temporarily correct for bad or excessive room reflections by using 
absorptive and/or offset reflective panels (to break up the problematic 
reflections). 

■ Place the mic closer to its source and add a degree of artificial ambience. 

If a distant mic is used to pick up a portion of the room sound, placing it at a 
random height can result in a hollow sound due to phase cancellations that 
occur between the direct sound and delayed sounds that are reflected off the 
floor and other nearby surfaces (Figure 4.29). If these delayed reflections arrive 
at the mic at a time that's equal to one-half a wavelength (or at odd múltiples 
thereof), the reflected signal will be 180° out of phase with the direct sound. 
This could produce dips in the signal's pickup response that could adversely 
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color the signal. Since the reflected sound is at a lower level than the direct 
sound (as a result of traveling farther and losing energy as it bounces off a 
surface), the cancellation will only be partially complete. Raising the mic will 
have the effect of reducing reflections (due to the increased distances that 
the reflected sound must travel), while moving the mic cióse to the floor will 
conversely reduce the path length and raise the range in which the frequency 
cancellation occurs. In practice, a height of 1/8 to 1/16 inch will raise the 
cancellation above 10 kHz. One such microphone design type, known as a 
boundary microphone (Figures 4.30 and 4.31), places an electret-condenser or 
condenser diaphragm well within these low height restrictions. For this reason, 
this mic type might be a good choice for use as an overall distant pickup, when 
the mies need to be out of sight (i.e., when placed on a floor, wall or large 

boundary). FIGURE 4.30 

The boundary 

CLOSE MICROPHONE PLACEMENT microphone system. 

When a cióse microphone placement is used, the 
mic is often positioned about 1 inch to 3 feet 
from a sound source. This commonly used tech- 
nique generally yields two results: 

■ It creates a tight, present sound quality. 

■ It effectively exeludes the acoustic 
environment. 

Because sound diminishes with the square of its 
distance from the sound source, a sound that 
originates 3 inches from the pickup will be much 
higher in level than one that originates 6 feet 
from the mic (Figure 4.32). Therefore, whenever 
cióse miking is used, only the desired on-axis 
sound will be recorded—extraneous, distant 
sounds (for all practical purposes) won't be 
picked up. In effect, the distant pickup will 




FIGURE 4.31 

The PZM-6D 
boundary micro¬ 
phone. (Courtesy 
of Crown Interna¬ 
tional, Inc., www. 
crownaudio.com.) 
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FIGURE 4.32 

Cióse miking reduces 
the effects of 
the acoustic 
environment. 


FIGURE 4.33 

Leakage due to 
indirect signal 
pickup. 



be masked by the closer sounds and/or will be reduced to a relative level that's 
well below the main pickup. 

Whenever an instrument's mic also picks up the sound of a nearby instrument, 
a condition known as leakage occurs (Figure 4.33). Whenever a signal is picked 
up by both its intended mic and a nearby mic (or mies), it's easy to see 
how the signáis could be combined together within the mixdown process. When 
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this occurs, level and phase cancellations often make it more difficult to have 
control over the volume and tonal character of the involved instruments within 
a mix. 

To avoid the problems that can be associated with leakage, try the following: 

■ Place the mies closer to their respective instruments (Figure 4.34a). 

■ Use directional mies. 

■ Place an acoustic barrier (known as a fíat, gobo, or divider) between the 
instruments (Figure 4.34b). Alternatively, mic/instruments can be sur- 
rounded on several sides by sound baffles and (if needed) a top can be 
draped over them. 

■ Spread the instruments farther apart. 

■ An especially loud (or quieter) instrument can be isolated by putting it 
in an unused iso-room or vocal or instrument booth. Electronic amps 
that are played at high volumes can also be recorded in such a room. An 
amp and the mic can be covered with a blanket or other flexible sound- 
absorbing material, so that there's a clear path between the amplifier and 
the mic. 

■ Separation can be achieved by plugging otherwise loud electronic instru¬ 
ments directly into the consolé via a direction injection (DI) box, thereby 
bypassing the miked amp. 

Obviously, these examples can only suggest the number of possibilities that can 
occur during a session. For example, you might choose not to isolate the instru¬ 
ments and instead, place them in an acoustically "Uve" room. This approach 
will require that you carefully place the mies in order to control leakage; 
however, the result will often yield a live and present sound. As an engineer, 
producer and/or artist, the choices belong to you. Remember, the idea is to 
work out the kinks beforehand and to simplify technology as much as possible 
in the studio because Murphy's law is always alive and well in any production 
fácil ity. 

Whenever individual instruments are being miked cióse (or semi-close), it's 
generally wise to follow the 3 :1 distance rule. 


3:1 Distance Rule 

To reduce leakage and maintain phase ¡ntegrity, this rule its source, a nearby mic (or mies) should be separated by 

States that for every unit of distance between a mic and at least three times that distance (Figure 4.35). 


Some err on the side of caution and avoid leakage even further by following a 
5:1 distance rule. As always, experience will be your best teacher. Although the 
cióse miking of a sound source offers several advantages, a mic should be placed 
only as cióse to the source as is necessary, not as cióse as possible. Miking too 
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FIGURE 4.34 a-b 

Two methods for 
reducing leakage: 

(a) Place the 
microphones closer 
to their sources. 

(b) Use an acoustic 
barrier to reduce 
leakage. 
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FIGURE 4.35 

Example of the 3:1 
microphone distance 
rule: “For every unit 
of distance between 
a mic and its source, 
a nearby mic (or 
mies) should be 
separated by at least 
three times that 
distance.” 


cióse can color the recorded tone quality of a source, unless care is taken and 
careful experimentation is done. 

It should be noted, however, that a bit of "bleed" (a slang word for leakage) 
between mies just might be a good thing. With semi-distant and even múltiple 
mies that are closely spaced, the pickup of a source by several pickups can add 
a sense of increased depth and sonic space. Having an overall distant set of 
mies in the studio can add a dose of natural ambience that can actually help to 
"glue" a mix together. The concept of minute phase cancellations and leakage 
in a mix isn't always something to be feared; it's simply important that you be 
aware of the effeets that it can have on a mix ... and use that knowledge to your 
advantage. 

Because cióse mic techniques commonly involve distances of 1 to 6 inches, the 
tonal balance (timbre) of an entire sound source often can't be picked up; 
rather, the mic might be so cióse to the source that only a small portion of the 
surface is actually picked up, giving it a tonal balance that's very area specific 
(much like hearing the focused parts of an instrument through an acoustic 
microscope). At these cióse distances, moving a mic by only a few inches can 
easily change the pickup tonal balance. If this occurs, try using one or more of 
the following remedies: 

■ Move the microphone along the surface of the sound source until the 
desired balance is achieved. 

■ Place the mic farther back from the sound source to allow for a wider 
angle (thereby picking up more of the instrument's overall sound). 

■ Change the mic. 

■ Equalize the signal until the desired balance is achieved. 













3 Microphone echniques 


c 0 I Y J 

do it yourself 

Tutor ial: Cióse Mic Experimentation 

1. Mic an acoustic ¡nstrument (such as a guitar or piano) to variations in the sound. Does the sound 

at a distance of 1 to 3 inches. change? What are your favorite and least favorite 

2. Move (or have someone move) the mic over the positions? 

instrument’s body as it’s being played, while listening 


In addition to all of the above considerations, the placement of musicians and 
instruments will often vary from one studio and/or session to the next because 
of the room, people involved, number of instruments, isolation (or lack thereof) 
among instruments, and the degree of visual contact that's needed for Creative 
communication. If additional isolation (beyond careful microphone place¬ 
ment) is needed, fíats and baffles can be placed between instruments in order 
to prevent loud sound sources from spilling over into other open mikes. Alter- 
natively, the instrument or instruments could be placed into sepárate isolation 
(iso) rooms and/or booths, or they could be overdubbed at a later time. 

During a session that involves several musicians, the setup should allow them 
to see and interact with each other as much as possible. It's extremely important 
that they be able to give and receive visual cues and otherwise "feel the vibe." 
The instrument/mic placement, baffle arrangement, and possibly room acous- 
tics (which can often be modified by placing absorbers in the room) will depend 
on the engineer's and artists' personal preferences, as well as on the type of 
sound the producer wants. 

ACCENT MICROPHONE PLACEMENT 

Often, the tonal and ambient qualities will sound very different between a 
distant- and close-miked pickup. Under certain circumstances, it's difficult to 
obtain a naturally recorded balance when mixing the two together. For example, 
if a solo instrument within an orchestra needs an extra mic for added volume 
and presence, placing the mic too cióse would result in a pickup that sounds 
overly present, unnatural and out of context with the distant, overall orchestral 
pickup. To avoid this pitfall, a compromise in distance should be struck. A 
microphone that has been placed within a reasonably cióse range to an instru¬ 
ment or section within a larger ensemble (but not so cióse as to have an 
unnatural sound) is known as an accent pickup (Figure 4.36). Whenever accent 
miking is used, care should be exercised in placement and pickup choices. The 
amount of accent signal that's introduced into the mix should sound natural 
relative to the overall pickup, and a good accent mic should only add presence 
to a solo passage and not stick out as sepárate, identifiable pickup. 
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FIGURE 4.36 

Accent microphone 
placed at proper 
compromise 
distance. 


AMBIENT MICROPHONE PLACEMENT 

Ambient miking places the pickup at such a distance that the reverberant or room 
sound is equally or more prominent than the direct signal. The ambient pickup 
is often a cardioid stereo pair or crossed figure-8 (Blumlein) pair that can be 
mixed into a stereo or surround-sound production to provide a natural reverb 
and/or ambience. To enhance the recording, you can use ambient mic pickups 
in the following ways: 

■ In a live concert recording, ambient mies can be placed in a hall to restore 
the natural reverberation that is often lost with cióse miking techniques. 

■ In a live concert recording, ambient microphones can be placed over the 
audience to pick up their reaction and applause. 

■ In a studio recording, ambient microphones can be used in the studio to 
add a sense of space or natural acoustics back into the sound. 


IY 1 

it yourself 

Tutorial: Ambient Miking 

1. Mic an instrument or ¡ts amp (such as an acoustic or 3. Mix the two pickup types together. Does it “open” the 

electric guitar) at a distance of 6 inches to 1 foot. sound up and give it more space? Does it muddy the 

2. Place a stereo mic pair (in an X/Y and/or spaced sound up or breathe new lite into it? 

configuration) in the room, away from the instrument. 
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Stereo miking techniques 

For the purpose of this discussion, the term stereo 
miking technique refers to the use of two micro- 
phones in order to obtain a coherent stereo 
image. These techniques can be used in either 
cióse or distant miking of single instruments, 
vocals, large or small ensembles, within on-loca- 
tion or studio applications ... in fact, the only 
limitation is your imagination. The four funda¬ 
mental stereo miking techniques are: 

■ Spaced pair 

■ X/Y 

■ M/S 

■ Decca tree. 

SPACED PAIR 


FIGURE 4.37 

Spaced stereo miking 
technique. 


Spaced microphones (Figure 4.37) can be placed in front of an instrument or 
ensemble (in a left/right fashion) to obtain an overall stereo image. This tech¬ 
nique places the two mies (of the same type, manufacturer and model) any- 
where from only a few feet to more than 30 feet apart (depending on the size 
of the instrument or ensemble) and uses time and amplitude cues in order to 
create a stereo image. The primary drawback to this technique is the strong 
potential for phase discrepancies between the two channels due to differences 
in a sound's arrival time at one mic relative to the other. When mixed to mono, 
these phase discrepancies could result in variations in frequeney response and 
even the pardal cancellation of instruments and/or sound components in the 
pickup field. 


X/Y 

X/Y stereo miking is an intensity-dependent system that uses only the cue of 
amplitude to discriminate direction. With the X/Y coincident-pair technique 
(Figure 4.38), two directional microphones of the same type, manufacture and 
model are placed with their grills as cióse together as possible (without touch- 
ing) and facing at angles to each other (generally between 90° and 135°). The 
midpoint between the two mies is pointed toward the source, and the mic 
outputs are equally panned left and right. Even though the two mies are placed 
together, the stereo imaging is excellent—often better than that of a spaced pair. 
In addition, due to their proximity, no appreciable phase problems arise. Most 
commonly, X/Y pickups use mies that have a cardioid polar pattern, although 
the Blumlein technique is being increasingly used. This technique (which is 
named after the unheralded inventor, Alan Dower Blumlein) uses two crossed 
bidirectional mies that are offset by 90° to each other. This simple technique 
often yields excellent ambient results for the pickup of the overall ambience 
within a studio or concert hall, while also being a good choice for picking up 
sources that are placed "in the round." 
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FIGURE 4.38 

X/Y stereo miking 
technique using an 
X/Y crossed cardioid 
pair. 


Stereo microphones that contain two diaphragms 
in the same case housing are also available on 
the new and used market. These mies are either 
fixed (generally in a 90° or switchable X/Y 
pattern) or are designed so that the top dia- 
phragm can be rotated by 180 ° (allowing for the 
adjustment of various coincident X/Y angles). 

M/S 


mid (cardioid) 


side + (fig 8) 



Another coincident-pair system, known as the 
M/S (or mid-side) technique (Figure 4.39), is 
similar to X/Y in that it uses two closely spaced, 
matched pickups. The M/S method differs from 
the X/Y method, however, in that it requires the 
use of an external transformer, active matrix, or 
software plug-in in order to work. In the elassie 

M/S stereo miking configuration, one of the microphone capsules is designated 
the M (mid) position pickup and is generally a cardioid pickup pattern that 
faces forward, toward the sound source. The S (side) capsule is generally chosen 
as a figure-8 pattern that's oriented sideways (90° and 270°) to the on-axis 
pickup (i.e., with the nuil facing to the side, away from the cardioid's main axis). 
In this way, the mid capsule picks up the direct sound, while the side figure-8 
capsule picks up ambient and reverberant sound. These outputs are then 
combined through a sum-and-difference decoder matrix either electrically 
(through a transformer matrix) or mathematically (through a digital M/S plug- 
in), which then resolves them into a conventional X/Y stereo signal: (M + S = 
left) and (M - S = right). 


side - (fig 8) 


FIGURE 4.39 

M/S stereo micro¬ 
phone technique. 
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One advantage of this technique is its absolute monaural compatibility. 
When the left and right signáis are combined, the sum of the output will be 
(M + S) + (M - S) = 2M. That's to say, the side (ambient) signal will be canceled, 
but the mid (direct) signal will be accentuated. Since it is widely accepted that 
a mono signal loses its intelligibility with added reverb, this tends to work to 
our advantage. Another amazing side benefit of using M/S is the fact that it lets 
us continuously vary the mix of mid (direct) to side (ambient) sound that's 
being picked up either during the recording (from the consolé location) ... or 
even at a later time during mixdown, after it's been recorded! These are both 
possible by simply mixing the ratio of mid to side that's being sent to the 
decoder matrix (Figure 4.40). In a mixdown scenario, all that's needed is to 
record the mid on one track and the side on another. (It's often best to use a 
digital recorder, because phase delays associated with the analog recording 
process can interfere with decoding.) During mixdown, routing the M/S tracks 



FIGURE 4.40 

M/S decoder matrix: 

(a) AEA MS-38 Mark 
II dual-mode stereo 
width controller and 
Matrix MS processor 
(courtesy of Audio 
Engineering 
Associates, www. 
ribbonmics.com); 

(b) Waves SI Stereo 
Imager plug-in 
ineludes True 
Blumlein shuffling 
and MS/LR Process¬ 
ing (courtesy of 
Waves, www.waves. 
com). 




(b) 
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FIGURE 4.41 

Decca tree 
microphone array. 
(Courtesy of Audio 
Engineering 
Associates, www. 
ribbonmics.com.) 



to the decoder matrix allows you to make important decisions regarding stereo 
width and depth at a later, more controlled date. 

DECCA TREE 

Although not as commonly used as the preceding stereo techniques, the Decca 
tree is a time-tested, classical miking technique that uses both time and ampli- 
tude cues in order to create a coherent stereo image. Attributed originally to 
Decca engineers Roy Wallace and Arthur Haddy in 1954, the Decca tree (Figure 
4.41) consists of three omnidirectional mies (originally, Neumann M50 mies 
were used). In this arrangement, a left and right mic pair is placed 3 feet apart, 
and a third mic is placed 1.5 feet out in front and panned in the center of the 
stereo field. Still favored by many in orchestral situations as a main pickup pair, 
the Decca tree is most commonly placed on a tall boom, above and behind the 
conductor. According to lore, when Haddy first saw the array, he remarked, "It 
looks like a bloody Christmas tree!" The ñame stuck. 

Surround miking techniques 

With the advent of 5.1 surround-sound production, it's certainly possible to 
make use of a surround consolé or DAW to place sources that have been 
recorded in either mono or stereo into a surround image field. Under certain 
situations, it's also possible to consider using multiple-pickup surround miking 
techniques in order to capture the actual acoustic environment and then trans¬ 
íate that into a surround mix. Just as the number of techniques and personal 
styles increases when miking in stereo compared to mono, the number of place- 
ment and technique choices will likewise increase when miking a source in 
surround. Although guidelines have been and will continué to be set, both 
placement and mixing styles are definitely an art and not a Science. 

AMBIENT SURROUND MICS 

A relatively simple, yet effective way to capture the surround ambience of a live 
or studio session is to simply place a spaced or coincident mic pair out in the 
studio at a distance from the sound source. These can be facing toward or away 
from the sound source, and placement is totally up to experimentation. During 
a surround mixdown, placing distant mies into the studio or hall can work 
wonders to add a sense of space to an ensemble group, drum set or instrument 
overdub. 
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do it yourself 


1. Mic an instrument or ensemble group using traditional 
cióse pickup techniques. 

2. Place a spaced or Blumlein pickup pair ¡n the room at 
a considerable distance from the source. 

3. Record the ambient mies to a spare set of tracks and 
place them into a stereo mix. 


If you’re lucky enough to be surround-capable, place 
the ambient tracks to the rear. Does it add an extra 
dimensión of space? Does it alter the recording’s 
definition? 


Tutorial: Ambient Surround Mies 

4. 



FIGURE 4.42 

Five cardioid 
microphones can be 
arranged in a circular 
pattern (with the 
center microphone 
facing toward the 
source) to create a 
modified, mini- 
surround Decca tree. 
A four-microphone 
(quad) approach to 
surround miking can 
be easily made by 
simply eliminating 
the center pickup. 


One last approach (which doesn't actually fall under the Decca 
tree category) involves the use of four cardioid mies that are 
spaced at 90° angles, representing L-R-SL-SR, with the on-axis 
point being placed 45° between the L and R mies. This "quad" 
configuration can be easily made by mounting the mies on two 
stereo bars that are offset by 90°. Note that Samson's H2 handheld flash 
memory recorder uses four mies to affordably and simply record in this fashion. 


RECORDING DIRECT 


As an alternative, the signal of an electric or electronic instrument (guitar, key- 
board, etc.) can be directly "injected" into a consolé, recorder or DAW without 
the use of a microphone. This option often produces a cleaner, more present 
sound by bypassing the distorted components of a head/amp combination. It 
also reduces leakage into other mies by eliminating room sounds. In the project 
or recording studio, the direct injection (DI) box (Figure 4.43) serves to interface 
an instrument with an analog output signal to a consolé or recorder in the fol¬ 
io wing ways: 


SURROUND DECCA TREE 

One of the most logical techniques for capturing an ensemble or 
instrument in a surround setting places five mies onto a modified 
Decca tree. This ingenious and simple system adds two rear-facing 
mies to the existing three-mic Decca tree system. Another simpler 
approach is to place five cardioid mies in a circle, such that the 
center channel faces toward the source, thereby creating a simple 
setup that can be routed L-C-R-SL-SR (Figure 4.42). 


■ It reduces an instrumenté line-level output to mic level for direct insertion 
into the console's mic input jack. 

■ It changes an instrumenté unbalanced, high-source impedance line to a 
balanced, low-source impedance signal thaté needed by the consoleé 
input stage. 
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■ It often can electrically isolate the audio 
signal paths between the instrument and 
mic/line preamp stages (thereby reducing 
the potential for ground-loop hum and 
buzzes). 


Most commonly, the instrumenté output is 
plugged directly into the DI box (where it's 
stepped down in level and impedance), and the 
box's output is then fed into the mic pre of a 
consolé or DAW. If a "dirtier" sound is desired, 
certain boxes will allow high-level input signáis 
to be taken directly from the amp's speaker 
output jack. It's also not uncommon for an engi- 
neer, producer and/or artist to combine the 
punchy, full sound of a mic with the present 
crispness of a direct sound. These signáis can 
then be combined onto a single tape track or 
recorded to sepárate tracks (thereby giving more 
flexibility in the mixdown stage). The ambient 
image can be "opened up" even further by mixing 
a semi-distant or distant mic (or stereo pair) with the direct (and even with the 
cióse miked amp) signal. This ambient pickup can be either mixed into a stereo 
field or at the rear of a surround field to fill out the sound. 

When recording a guitar, the best tone and lowest hum pickup for a direct con- 
nection occurs when the instrument volume control is fully turned up. Because 
guitar tone Controls often use a variable treble roll-off, leaving the tone Controls 
at the treble setting and using a combination of consolé EQ and different guitar 
pickups to vary the tone will often yield the máximum amount of control over 
the sound. Note that if the treble is rolled off at the guitar, boosting the highs 
with EQ will often increase pickup noise. 


FIGURE 4.43 

Radial JDI passive 
direct box. (Courtesy 
of Radial Engineer- 
ing, www.radialeng. 
com.) 



REAMPING IT IN THE MIX 

Another way to alter the sound of a recorded track or to inject a new sense of 
acoustic space into an existing take is to reamp a track. The "reamp" process 
(originally conceived in 1993 by recording engineer John Cuniberti; www. 
reamp.com) lets us record a guitar's signal directly to a track using a DI during 
the recording session and then play this cleanly recorded track back through a 
miked guitar amp/speaker, allowing it to be re-recorded to new tracks at another 
time (Figure 4.44). 

The re-recording of an instrument that has been recorded directly gives us total 
flexibility for changing the final, recorded amp and mic sound at a later time. 
For example, it's well known that it's far easier to add an effect to a "dry" track 
that doesn't have effects during mixdown than to attempt to remove an effect 
after it's been printed to track. Whenever reamping is used at a later time, it's 
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FIGURE 4.44 

Example of how a 
direct recording can 
be “reamped” in a 
studio, allowing for 
complete tonal, mic 
placement, and 
acoustical control 
... after the fact! 
(Courtesy of John 
Cuniberti, www. 
reamp.com.) 



possible to audition any number of amps, using any number of effects and/or 
mic settings, until the desired sound has been found. This process allows the 
musician to concéntrate solely on getting the best recorded performance, 
without having to spend extra time getting the perfect guitar, amp, mic and 
room sound. Leakage problems in the studio are also reduced, because no mikes 
are used in the process. 

Although the concept of recording an instrument directly and playing the track 
back through a miked amp at a later time is relatively new, the idea of using a 
room's sound to fill out the sound of a track or mix isn't. The reamp concept 
takes this idea a bit further by letting you go as wild as you like. For example, 
you could use the process to re-record a single, close-miked guitar amp and then 
go back and layer a larger stack at a distance. An electronic guitarist could take 
the process even further by recording his or her MIDI guitar both directly and 
to a sequenced MIDI track. In this way, the reamp and patch combinations 
would be virtually unlimited. 

MICROPHONE PLACEMENT TECHNIQUES 

The following sections are meant to be used as a general guide to mic placement 
for various acoustic and popular instruments. It's important to keep in mind 
that these are only guidelines. Several general application and characteristic 
notes are detailed in Table 4.1, and descriptions of several popular mies are 
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Table 4.1 Microphone selection guidelines. 

Needed Application 

Required Microphone Choice and/or Characteristic 

Natural, smooth tone quality 

Fiat frequeney response 

Bright, present tone quality 

Rising frequeney response 

Extended lows 

Dynamic or condenser with extended low-frequency 
response 

Extended highs (detailed sound) 

Condenser 

Increased “edge” or midrange detail 

Dynamic 

Extra ruggedness 

Dynamic or modern ribbon/condenser 

Boosted bass at cióse working 
distances 

Directional microphone 

Fiat bass response up cióse 

Omnidirectional microphone 

Reduced leakage, feedback, and 
room acoustics 

Directional microphone, or omnidirectional 
microphone at cióse working distances 

Enhanced pickup of room acoustics 

Place microphone or stereo pair at greater working 
distances 

Reduced handling noise 

Omnidirectional, vocal microphone, or directional 
microphone with shock mount 

Reduced breath popping 

Omnidirectional or directional microphone with pop filter 

Distortion-free pickup of very loud 
sounds 

Dynamic or condenser with high máximum SPL rating 

Noise-free pickup of quiet sounds 

Condenser with low self-noise and high sensitivity 


outlined toward the end in the "Microphone Selection" section to help give 
insights into placement and techniques that might work best in a particular 
application. 

As a general rule, choosing the best mic for an instrument or vocal will ulti- 
mately depend on the sound you're searching for. For example, a dynamic mic 
will often yield a "rugged" or "punchy" character (which is often further accen- 
tuated by the proximity of bass boost that's generally associated with a direc- 
tional mic). A ribbon mic will often yield a mellow sound that ranges from 
being open and clear to slightly "croony" ... depending on the type and dis- 
tances involved. Condenser mies are often characterized as having a clear, 
present and full-range sound that varíes with mic design, grill options and 
capsule size. Before jumping into this section, I'd like to again take time to point 
out the "Good Rule" to anyone who wants to be a better engineer, producer 
and/or musician: 
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As a rule, starting with an experienced, rehearsed and ready musician who has 
a quality instrument that's well tuned is the best insurance toward getting the 
best possible sound. Let's think about this for a moment. Say that we have a 
live rhythm session that involves drums, piano, bass guitar and scratch vocals. 
All of the players are the best around, except for the drummer, who is new to 
the studio process. Unfortunately, you've now signed on to teach the drummer 
the ropes of proper drum tuning, studio interaction and playing under pressure. 
It goes without saying that the session might go far less smoothly than it 
otherwise would, as you'll have to take the extra time to work with the player 
to get the best possible sound. Once you're rolling, it'll also be up to you or 
the producer to pulí a professional performance out of someone who's new to 
the field. 

Don't get me wrong, musicians have to start somewhere ... but an experienced 
studio musician who comes into the studio with a great instrument that's tuned 
and ready to go (and who might even clue you in on some sure-fire mic and 
placement techniques for the instrument) is simply a joy from a sound, perfor¬ 
mance, time and budget-saving standpoint. Simply put, if you and/or the proj- 
ect's producer have prepared enough to get all your "goods" lined up, the track 
will have a much better chance of being something that everyone can be proud 
of. Just as with the art of playing an instrument, careful mic choice, placement 
and "style" in the studio are also subjective ... and are a few of the fundamen¬ 
tal calling cards of a good engineer. Experience simply comes with time and the 
willingness to experiment. Be patient, learn, listen and have fun ... and you too 
will eventually rise to the professional occasion. 

Brass instruments 

The following sections describe many of the sound characteristics and miking 
techniques that are encountered in the brass family of instruments. 

TRUMPET 

The fundamental frequency of a trumpet ranges from E3 to D6 (165 to 1175 Hz) 
and contains overtones that stretch upward to 15 kHz. Below 500 Hz, the 
sounds emanating from the trumpet project uniformly in all directions; above 
1500 Hz, the projected sounds become much more directional; and above 
5 kHz, the dispersión emanates at a tight 30° angle from in front of the bell. 
The formants of a trumpet (the relative harmonic and resonance frequencies 
that give an instrument its specific character) lie at around 1 to 1.5 kHz and at 
2 to 3 kHz. Its tone can be radically changed by using a mute (a cup-shaped 
dome that fits directly over the bell), which serves to dampen frequencies above 
2.5 kHz. A conical mute (a metal mute that fits inside the bell) tends to cut 
back on frequencies below 1.5 kHz while encouraging frequencies above 4 kHz. 
Because of the high sound-pressure levels that can be produced by a trumpet 
(up to 130 dB SPL), it's best to place a mic slightly ofif the bell's center at a 
distance of 1 foot or more (Figure 4.45). When closer placements are needed, 
a -10- to -20-dB pad can help prevent input overload at the mic or consolé 
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preamp input. Under such cióse working condi- 
tions, a windscreen can help protect the dia- 
phragm from windblasts. 

TROMBONE 

Trombones come in a number of sizes; however, 
the most commonly used "bone" is the tenor, 
which has a fundamental note range spanning 
from E2 to C5 (82 to 523 Hz) and produces a 
series of complex overtones that range from 
5 kHz (when played médium loud) to 10 kHz 
(when overblown). The trombones polar pattern 
is nearly as tight as the trumpet's: Frequencies 
below 400 Hz are distributed evenly, whereas its 
dispersión angle increases to 45 ° from the bell 
at 2 kHz and above. The trombone most often 
appears in jazz and classical music. The Mass in 
C Minor by Mozart, for example, has parts for 
soprano, alto, tenor and bass trombones. This 
style obviously lends itself to the spacious blend- 
ing that can be achieved by distant pickups 
within a large hall or studio. On the other hand, 
jazz music often calis for closer miking distances. At 2 to 12 inches, for example, 
the trombonist should play slightly to the side of the mic to reduce the chance 
of overload and wind blasts. In the miking of a trombone section, a single mic 
might be placed between two players, acoustically combining them onto a 
single channel and/or track. 

TUBA 

The bass and double-bass tubas are the lowest pitched of the brass/wind instru- 
ments. Although the bass tuba's range is actually a fifth higher than the double 
bass, it's still possible to obtain a low fundamental of B (31 Hz). A tuba's over- 
tone structure is limited; it's top response ranges from 1.5 to 2 kHz. The lower 
frequencies (around 75 Hz) are evenly dispersed; however, as frequencies rise, 
their distribution angles reduce. Under normal conditions, this class of instru- 
ments isn't miked at cióse distances. A working range of 2 feet or more, slightly 
off-axis to the bell, will generally yield the best results. 

FRENCH HORN 

The fundamental tones of the French horn range from B1 to B5 (62 to 700 Hz). 
Its "oo" formant gives it a round, broad quality that can be found at about 
340 Hz, with other frequencies falling between 750 Hz and 3.5 kHz. French 
horn players often place their hands inside the bell to mute the sound and 
promote a formant at about 3 kHz. A French horn player or section is tradition- 
ally placed at the rear of an ensemble, just in front of a rear, reflective stage wall. 




FIGURE 4.45 

Typical microphone 
placement for a 
single trumpet. 













Microphone Placement echniques 


This wall serves to reflect the sound back toward the listener's position (which 
tends to create a fuller, more defined sound). An effective pickup of this instru- 
ment can be achieved by placing an omni- or bidirectional pickup between the 
rear, reflecting wall and the instrument bells, thereby receiving both the direct 
and reflected sound. Alternatively, the pickups can be placed in front of 
the players, thereby receiving only the sound that's being reflected from the 
rear wall. 

Guitar 

The following sections describe the various sound characteristics and techniques 
that are encountered when miking the guitar. 

ACOUSTIC GUITAR 

The popular steel-strung, acoustic guitar has a bright, rich set of overtones 
(especially when played with a pick). Mic placement and distance will often 
vary from instrument to instrument and may require experimentation to pick 
up the best tonal balance. A balanced pickup can often be achieved by placing 
the mic (or an X/Y stereo pair) at a point slightly off-axis and above or below 
the sound hole at a distance of between 6 inches and 1 foot (Figure 4.46). 
Condenser mies are often preferred for their smooth, extended frequeney 
response and excellent transient response. The smaller-bodied classical guitar is 
normally strung with nylon or gut and is played with the fingertips, giving it a 
warmer, mellower sound than its steel-strung counterpart. To make sure that 
the instrument's ftill range is picked up, place the mic closer to the center of 
the bridge, at a distance of between 6 inches and 1 foot. 

Miking near the sound hole 

The sound hole (located at the front face of a guitar) serves as a bass port, which 
resonates at the lower frequencies (around 80 to 100 Hz). Placing a mic too 


FIGURE 4.46 

Typical microphone 
placement for the 
guitar. 
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cióse to the front of this port might result in a boomy and unnatural sound; 
however, miking cióse to the sound hole is often popular on stage or around 
high acoustic levels because the guitar's output is highest at this position. To 
achieve a more natural pickup under these conditions, the microphone's output 
can be rolled off at the lower frequencies (5 to 10 dB at 100 Hz). 

Surround guitar miking 

An effective way to transíate an acoustic guitar to the wide stage of surround (if 
a big, full sound is what you're after) is to record the guitar using X/Y or spaced 
techniques stereo (panned front L/R) ... and pan the guitar's electric pickup (or 
added contact pickup) to the rear center of the surround field. Extra ambient 
surround mies can also be used in an all-acoustic session. 


THE ELECTRIC GUITAR 

The fundamentáis of the average 22-fret guitar extend from E2 to D6 (82 to 
1174 Hz), with overtones that extend much higher. All of these frequencies 
might not be amplified, because the guitar chord tends to attenuate frequencies 
above 5 kHz (unless the guitar has a built-in low impedance converter or low- 
impedance pickups). The frequeney limitations of the average guitar loud- 
speaker often add to this effect, because their upper limit is generally restricted 
to below 5 or 6 kHz. 


Miking the guitar amp 

The most popular guitar amplifier used for recording is a small practice-type 
amp/speaker system. These high-quality amps often help the guitar's suffering 
high end by incorporating a sharp rise in the response range at 4 to 5 kHz, thus 
helping to give it a clean, open sound. High-volume, wall-of-sound speaker 
stacks are less commonly used in a session, because they're harder to control 
in the studio and in a mix. By far the most popular mic type for picking up an 
electric guitar amp is the cardioid dynamic. A dynamic tends to give the sound 
a full-bodied character without picking up extraneous amplifier noises. Often 
guitar mies will have a pronounced presence peak in the upper frequeney range, 
giving the pickup an added clarity. For increased separation, a 
microphone can be placed at a working distance of 2 inches to 1 
foot. When miking at a distance of less than 4 inches, mic/speaker 
placement becomes slightly more critical (Figure 4.47). For a 
brighter sound, the mic should face directly into the center of the 
speaker's cone. Placing it off the cone's center tends to produce a 
more mellow sound while reducing amplifier noise. 

Isolation cabinets have also come onto the market that are 
literally sealed boxes that house a speaker or guitar amp/cabinet 
system, as well as an internal mic mount. These systems are 
used to reduce leakage and to provide greater control over instru- 
ment levels within a recording studio or control room during a 


FIGURE 4.47 

Miking an electric 
guitar cabinet directly 
in front of and 
off-center to the 
cone. 
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FIGURE 4.48 

Direct recording of 
an electric guitar: 
(a) direct recording; 


Recording direct 

A DI box is often used to feed the output signal 
of an electric guitar directly into the mic input 
stage of a recording consolé or mixer. By routing 
the direct output signal to a track, a cleaner, 
more present sound can be recorded (Figure 
4.48a). This technique also reduces the leakage 
that results from having a guitar amp in the 
studio and even makes it possible for the guitar 
to be played in the control room or project 
studio. A combination of direct and miked 
signáis often results in a sound that adds the 
characteristic fullness of a miked amp to the 
extra "bite" that a DI tends to give. These may be 
combined onto a single track or, whenever pos¬ 
sible, can be assigned to sepárate tracks, allowing 
for greater control during mixdown (Figure 
4.48b). During an overdub, the ambient image 
can be "opened up" even further by mixing a 
semidistant or distant mic (or stereo pair) with 
the direct mic (and even with the cióse miked 
amp signal). This ambient pickup can be either 
mixed into a stereo field or at the rear of a sur- 
round field to fill out the sound. 


(b) combined direct THE ELECTRIC BASS GUITAR 
and miked signal. 


The fundamentáis of an electric bass guitar range from about El to F4 (41.2 to 
343.2 Hz). If it's played loudly or with a pick, the added harmonics can range 
upward to 4 kHz. Playing in the "slap" style or with a pick gives a brighter, 
harder attack, while a "fingered" style will produce a mellower tone. In modern 
music production, the bass guitar is often recorded direct for the cleanest pos¬ 
sible sound. As with the electric guitar, the electric bass can be either miked at 
the amplifier or picked up through a DI box. If the amp is miked, dynamic mies 
usually are chosen for their deep, rugged tones. The large-diaphragm dynamic 
designs tend to subdue the high-frequeney transients. When combined with a 
boosted response at around 100 Hz, these large diaphragm dynamics give a 
warm, mellow tone that adds power to the lower register. Equalizing a bass can 
sometimes increase its clarity, with the fundamental being affected from 125 to 
400 Hz and the harmonic punch being from 1.5 to 2 kHz. A compressor is 
commonly used on electric and acoustic basses. It's a basic fact that the signal 
output from the instrument's notes often varíes in level, causing some notes to 
stand out while others dip in volume. A compressor having a smooth input/ 
output ratio of roughly 4:1, a fast attack (8 to 20 milliseconds), and a slower 
release time (X t° X second) can often smooth out these levels, giving the 
instrument a strong, present and smooth bass line. 
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Keyboard instruments 

The following sections describe the various sound characteristics and techniques 
that are encountered when miking keyboard instruments. 

GRAND PIANO 

The grand piano is an acoustically complex instrument that can be miked in a 
variety of ways, depending on the style and preferences of the artist, producer 
and/or engineer. The overall sound emanates from the instrument's strings, 
soundboard and mechanical hammer system. Because of its large surface area, 
a minimum miking distance of 4 to 6 feet is needed for the tonal balance to 
fully develop and be picked up; however, leakage from other instruments often 
means that these distances aren't practical or possible. As a result, pianos are 
often miked at distances that favor such instrument parís as: 

■ Strings and soundboard, often yielding a bright and relatively natural tone 

■ Hammers, generally yielding a sharp, percussive tone 

■ Soundboard boles alone, often yielding a sharp, full-bodied sound. 

In modern music production, two basic grand piano styles can be found in the 
recording studio: the concert grand, which traditionally has a rich and full- 
bodied tone (often used for classical music and ranging in size up to 9 feet in 
length), and the studio grand, which is more suited for modern music produc¬ 
tion and has a sharper, more percussive edge to its tone (often being about 7 
feet in length). 

Figure 4.49 shows a number of miking positions that can be used in recording 
a grand piano. Although several mic positions are illustrated, it's important to 



FIGURE 4.49 

Possible miking 
combinations for the 
grand piano. 
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keep in mind that these are only guidelines from which to begin. Your own 
personal sound can be achieved through mic choice and experimentation with 
mic placement. 

Position 1: The mic is attached to the partially or entirely open lid of the piano. 
The most appropriate choice for this pickup is the boundary mic, which 
can be permanently attached or temporarily taped to the lid. This method 
uses the lid as a collective reflector and provides excellent pickup under 
restrictive conditions (such as on stage and during a live video shoot). 

Position 2: Two mies are placed in a spaced stereo configuration at a working 
distance of 6 inches to 1 inch. One mic is positioned over the low strings 
and one is placed over the high strings. 

Position 3: A single mic or coincident stereo pair is placed just inside the 
piano between the soundboard and its fully or partially open lid. 

Position 4: A single mic or stereo coincident pair is placed outside the piano, 
facing into the open lid (this is most appropriate for solo or accent 
miking). 

Position 5: A spaced stereo pair is placed outside the lid, facing into the 
instrument. 

Position 6: A single mic or stereo coincident pair is placed just over the piano 
hammers at a working distance of 4 to 8 inches to give a driving pop or 
rock sound. 

A condenser or extended-range dynamic mic is most often the preferred choice 
when miking an acoustic grand piano, as those types of mies tend to accurately 
represent the transient and complex nature of the instrument. Should excessive 
leakage be a problem, a close-miked cardioid (or cardioid variation) can be 
used; however, if leakage isn't a problem, backing away to a compromise dis¬ 
tance (3 to 6 feet) can help capture the instrument's overall tonal balance. 

Separation 

Separation is often a problem associated with the grand piano whenever it is 
placed next to noisy neighbors. Separation, when miking a piano, can be 
achieved in the following ways: 

■ Place the piano inside a sepárate isolation room. 

■ Place a fíat (acoustic separator) between the piano and its louder neighbor. 

■ Place the mies inside the piano and lower the lid onto its short stick. A 
heavy moving or other type of blanket can be placed over the lid to further 
reduce leakage. 

■ Overdub the instrument at a later time. In this situation, the lid can be 
removed or propped up by the long stick, allowing the mies to be placed 
at a more natural-sounding distance. 

UPRIGHT PIANO 

You would expect the techniques for this seemingly harmless piano type to be 
similar to those for its bigger brother. This is partially true. However, because 


Microphones: Design and Application 1APT R O 





FIGURE 4.50 

One possible pickup 
combination places 
the mies over the top 
of an upright piano. 


this instrument was designed for home enjoyment and not performance, the 
mic techniques are often very different. Since it's often more difficult to achieve 
a respectable tone quality when using an upright, you might want to try the 
following methods (Figure 4.50): 

■ Miking over the top: Place two mies in a spaced fashion just over and in 
front of the piano's open top, with one over the bass strings and one over 
the high strings. If isolation isn't a factor, remove or open the front face 
that covers the strings in order to reduce reflections and, therefore, the 
instrument's characteristic "boxy" quality. Also, to reduce resonances you 
might want to angle the piano out and away from any walls. 

■ Miking the kickboard area: For a more natural sound, remove the kickboard 
at the lower front part of the piano to expose the strings. Place a stereo 
spaced pair over the strings (one each at a working distance of about 8 
inches over the bass and high strings). If only one mic is used, place it 
over the high-end strings. Be aware, though, that this placement can pick 
up excessive foot-pedal noise. 

■ Miking the upper soundboard area: To reduce excessive hammer attack, place 
a microphone pair at about 8 inches from the soundboard, above both 
the bass and high strings. In order to reduce muddiness, the soundboard 
should be facing into the room or be moved away from nearby walls. 

ELECTRONIC KEYBOARD INSTRUMENTS 

Signáis from most electronic instruments (such as synthesizers, samplers and 
drum machines) are often taken directly from the device's line-level output(s) 
and inserted into a consolé, either through a DI box or directly into a channel's 
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FIGURE 4.51 

A Leslie speaker 
cabinet creates a 
unique vibrato effect 
by using a set of 
rotating speaker 
baffles that spin on a 
horizontal axis, (a) 
Miking the rotating 
speakers of a Leslie 
cabinet; (b) modern 
portable rotary amp 
with built-in 
microphones and 
three XLR outputs. 
(Courtesy of Motion 
Sound, www. 
motion-sound.com.) 


line-level input. Alternatively, the keyboard's output can be plugged directly into 
the recorder or interface line-level inputs. The approach to miking an electronic 
organ can be quite different from the techniques just mentioned. A good 
Hammond or other older organ can sound wonderfully "dirty" through miked 
loudspeakers. Such organs are often played through a Leslie cabinet (Figure 
4.51), which adds a unique, Doppler-based vibrato. Inside the cabinet is a set 
of rotating speaker baffles that spin on a horizontal axis and, in turn, produce 
a pitch-based vibrato as the speakers accelerate toward and away from the mies. 
The upper high-frequeney speakers can be picked up by either one or two mies 
(each panned left and right), with the low-frequency driver being picked up by 
one mic. Motor and baffle noises can produce quite a bit of wind, possibly 
creating the need for a windscreen and/or experimentation with placement. 



Percussion 

The following sections describe the various sound characteristics and techniques 
that are encountered when miking drums and other percussion instruments. 

DRUMSET 

The standard drum kit (Figure 4.52) is often at the foundation of modern music, 
because it provides the "heartbeat" of a basic rhythm track; consequently, a 
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FIGURE 4.52 

Peter Erskine’s studio 
drum kit. (Courtesy 
of Beyerdynamic, 
www.beyerdynamic. 
com.) 


proper drum sound is extremely important to the outcome of most music 
projects. Generally, the drum kit is composed of the kick drum, snare drum, 
high-toms, low-tom (one or more), hi-hat and a variety of cymbals. Since a full 
kit is a series of interrelated and closely spaced percussion instruments, it often 
takes real skill to transíate the proper spatial and tonal balance into a project. 
The larger-than-life driving sound of the acoustic rock drum set that we've all 
become familiar with is the result of an expert balance among playing tech- 
niques, proper tuning and mic placement. 

During the past several decades, drums have undergone a substantial change 
with regard to playing technique, miking technique and choice of acoustic 
recording environment. In the 1960s and 1970s, the drum set was placed in a 
small isolation room called a drum booth. This booth acoustically isolated the 
instrument from the rest of the studio and had the effect of tightening the drum 
sound because of the limited space (and often dead acoustics). The drum booth 
also physically isolated the musician from the studio, which often caused the 
musician to feel removed and less involved in the action. Today, many engineers 
and producers have moved the drum set out of smaller iso-rooms and back into 
larger open studio areas where the sound can fully develop and combine with 
the studio's own acoustics. In many cases, this effect can be exaggerated by 
placing a distant mic pair in the room (a technique that often produces a fuller, 
larger-than-life sound, especially in surround). 

Before a session begins, the drummer should tune each drum while the mies 
and baffles for the other instruments are being set up. Each drumhead should 
be adjusted for the desired pitch and for constant tensión around the rim by 
hitting the head at various points around its edge and adjusting the lugs for the 
same pitch all around the head. Once the drums are tuned, the engineer should 
listen to each drum individually to make sure that there are no buzzes, rattles, 
or resonant after-rings. Drums that sound great in live performance may not 
sound nearly as good when being cióse miked. In a live performance, the rattles 
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and rings are covered up by the other instruments and are lost before the sound 
reaches the listener. Cióse miking, on the other hand, picks up the noises as 
well as the desired sound. 

If tuning the drums doesn't bring the extraneous noises or rings under control, 
duct or masking tape can be used to dampen them. Pieces of cloth, dampening 
rings, paper towels, or a wallet can also be taped to a head in various locations 
(which is determined by experimentation) to eliminate rings and buzzes. 
Although head damping has been used extensively in the past, present methods 
use this damping technique more discreetly and will often combine dampening 
with proper design and tuning styles (all of which are the artist's personal cali). 

During a session, it's best to remove the damping mechanisms that are built 
into most drum sets, because they apply tensión to only one spot on the head 
and unbalance its tensión. These built-in dampeners often vibrate when the 
head is hit and are a chief source of rattles. Removing the front head and placing 
a blanket or other damping material inside the drum (so that it's pressing 
against the head) can often dampen the kick drum. Adjusting the amount of 
material can vary the sound from being a resonant boom to a thick, dull thud. 
Kick drums are usually (but not always) recorded with their front heads removed, 
while other drums are recorded with their bottom heads either on or off. Tuning 
the drums is more difficult if two heads are used because the head tensions 
often interact; however, they will often produce a more resonant tone. After the 
drums have been tuned, the mikes can be put into position. It's important to 
keep the mies out of the drummer's way, or they might be hit by a stick or 
moved out of position during the performance. 

Miking the drum set 

After the drum set has been optimized for the best sound, the mies can be placed 
into their pickup positions (Figure 4.53). Because each part of the drum set is 
so different in sound and function, it's often best to treat each grouping as an 
individual instrument. In its most basic form, the best place to start when 
miking a drum set is to start with the fundamental "groups." These inelude 
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FIGURE 4.53 

Typical microphone placements for a drum set: (a) side view; (b) front view; (c) top view. 
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placing a mic on the kick (1) and on the snare drum (4). At an absolute 
mínimum, the entire drum set can be adequately picked up using only four 
mies by adding two overhead pickups, either spaced (3) or coincident (4). In 
fact, this "bare bones" placement was (and continúes to be) commonly used 
on many elassie jazz recordings. If more tracks are available (or required), addi- 
tional mies can be placed on the various toms, hi-hat and even individual 
cymbals. 

A mic's frequeney response, polar response, proximity effect and transient 
response should be taken into account when matching it to the various drum 
groups. Dynamic range is another important consideration when miking drums. 
Since a drum set is capable of generating extremes of volume and power (as 
well as softer, more subtle sounds), the chosen mies must be able to withstand 
strong peaks without distorting, and yet still be able to capture the more delicate 
nuances of a sound. 

Since the drum set usually is one of the loudest sound sources in a studio 
setting, it's often wise to place it on a solidly supported riser. This reduces 
the amount of low-end "thud" that can otherwise leak through the floor into 
other parts of the studio. Depending on the studio layout, the following drum 
scenarios may occur: 

■ The drums could be placed in their own room, isolated from other 
instruments. 

■ To achieve a bigger sound, the drums could be placed in the large studio 
room while the other instruments are placed in smaller iso-rooms or are 
recorded direct. 

■ To reduce leakage, the drums could be placed in the studio, while being 
enclosed by 4-foot (or higher) divider fíats. 

Kick drum 

The kick drum adds a low-energy drive or "punch" to a rhythm groove. This 
drum has the capability to produce low frequencies at high sound-pressure 
levels, so it's necessary to use a mic that can both handle and faithfully repro¬ 
duce these signáis. Often the best choice for the job is a large-diaphragm 
dynamic mic. Since proximity effect (bass boost) occurs when using a direc- 
tional mic at cióse working distances and because the drum's harmonics vary 
over its large surface area, even a minor change in placement can have a pro- 
found effect on the pickup's overall sound. Moving the mic closer to the head 
(Figure 4.54) can add a degree of warmth and fullness, while moving it farther 
back often emphasizes the high-frequeney "click." Placing the mic closer to the 
beater emphasizes the hard "thud" sound, whereas an off-center pickup cap¬ 
tures more of the drum's characteristic skin tone. A dull and loose kick sound 
can be tightened to produce a sharper, more defined transient sound by placing 
a blanket or other damping material inside the drum shell firmly against the 
beater head. Cutting back on the kick's equalization at 300 to 600 Hz can help 
reduce the dull "cardboard" sound, whereas boosting from 2.5 to 5 kHz adds 
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FIGURE 4.54 

Placing the micro¬ 
phone at a distance 
just outside the kick 
drumhead to bring 
out the low end and 
natural fullness. 



FIGURE 4.55 

Typical microphone 
positioning for the 
snare drum. 



a sharper attack, "click" or "snap." It's also often a good idea to have a can of 
WD-40® or other light oil handy in case squeaks from some of the moving parís 
(most often the kick pedal) gets picked up by the mies. 

Snare drum 

Commonly, a snare mic is aimed just inside the top rim of the snare drum at 
a distance of about 1 inch (Figure 4.55). The mic should be angled for the best 
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possible separation from other drums and cymbals. Its rejection angle should 
be aimed at either the hi-hat or rack-toms (depending on leakage difficulties). 
Usually, the mic's polar response is cardioid, although bidirectional and super- 
cardioid responses might offer a tighter pickup angle. With certain musical styles 
(such as jazz), you might want a crisp or "bright" snare sound. This can be 
achieved by placing an additional mic on the snare drum's bottom head and 
then combining the two mies onto a single track. Because the bottom snare 
head is 180° out of phase with the top, it's generally a wise idea to reverse the 
bottom mic's phase polarity. When playing in styles where the snare springs are 
turned off, it's also wise to keep your ears open for snare rattles and buzzes that 
can easily leak into the snare mic (as well as other mies). The continued ringing 
of an "open" snare note (or any other drum type, for that matter) can be damp- 
ened in several ways. Dampening rings, which can be purchased at music stores, 
are used to reduce the ring and to deepen the instrument's tone. If there are 
no dampening rings around, the tone can be dampened by taping a billfold 
or similar-sized folded paper towel to the top of a drumhead, a few inches off 
its edge. 


Overheads 


Overhead mies are generally used to pick up the high-frequeney transients of 
cymbals with crisp, accurate detail while also providing an overall blend of the 
entire drum kit. Because of the transient nature of cymbals, a condenser mic is 
often chosen for its accurate high-end response. Overhead mic placement can 
be very subjective and personal. One type of placement is the spaced pair, 
whereby two mies are suspended above the left and right sides of the kit. These 
mies are equally distributed about the L/R cymbal clusters so as to pick up their 
respective instrument components in abalanced fashion (Figure 4.56a). Another 


FIGURE 4.56 

Typical stereo 
overhead pickup 
positions: (a) spaced 
pair technique; 

(b) X/Y coincident 
technique. 
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placement method is to suspend the mies closely together in a coincident 
fashion (Figure 4.56b). This often yields an excellent stereo overhead image 
with a minimum of the phase cancellations that might otherwise result when 
using spaced mies. Again, it's important to remember that there are no rules for 
getting a good sound. If only one overhead mic is available, place it at a central 
point over the drums. If you're using a number of pickups to cióse mic indi¬ 
vidual components of a kit, there might be times when you won't need over- 
heads at all (the leakage spillover just might be enough to do the trick). 

Rack-toms 

The upper rack-toms can be miked either individually (Figure 4.57) or by 
placing a single mic between the two at a short distance (Figure 4.58). When 
miked individually, a "dead" sound can be achieved by placing the mic cióse 
to the drum's top head (about 1 inch above and 1 to 2 inches in from the outer 
rim). A sound that's more "live" can be achieved by increasing the height above 
the head to about 3 to 6 inches. If isolation or feedback is a consideration, a 
hypercardioid pickup pattern can be chosen. Another way to reduce leakage and 
to get a deep, driving tone (with less attack) is to remove the tom's bottom head 
and place the mic inside, 1 to 6 inches away from the top head. 

Floor-tom 

Floor-toms can be miked similarly to the rack-toms (Figure 4.59). The mic can 
be placed 2 to 3 inches above the top and to the side of the head, or it can be 
placed inside 1 to 6 inches from the head. Again, a single mic can be placed 
above and between the two floor-toms, or each can have its own mic pickup 
(which often yields a greater degree of control over panning and tonal color). 



FIGURE 4.57 

Individual miking of a 
rack-tom. 
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FIGURE 4.58 

Single microphone 
placement for picking 
up two toms. 



FIGURE 4.59 

Typical microphone 
placement for the 
floor-tom. 


Hi-hat 

The "hat" usually produces a strong, sibilant energy in the high-frequency range, 
whereas the snare's frequencies often are more concentrated in the midrange. 
Although moving the hat's mic won't change the overall sound as much as it 
would on a snare, you should still keep the following three points in mind: 

■ Placing the mic above the top cymbal will help pick up the nuances of 
sharp stick attacks. 

■ The open and closing motion of the hi-hat will often produce rushes of 
air; consequently, when miking the hat's edge, angle the mic slightly above 
or below the point where the cymbals meet. 
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■ If only one mic is available (or desired), both the snare and hi-hat can be 
simultaneously picked up by carefully placing the mic between the two, 
facing away from the rack-toms as much as possible. Alternatively, a 
figure-8 mic can be placed between the two with the nuil axis facing 
toward the cymbals and the kick. 

TUNED PERCUSSION INSTRUMENTS 

The following sections describe the various sound characteristics and techniques 
that are encountered when miking tuned percussion instruments. 

Congas and hand drums 

Congas, tumbas and bongos are single-headed, low-pitched drums that can be 
individually miked at very cióse distances of 1 to 3 inches above the head and 
2 inches in from the rim, or the mies can be pulled back to a distance of 1 foot 
for a fuller, "Uve" tone. Alternatively, a single mic or X/Y stereo pair can be 
placed at a point about 1 foot above and between the drums (which are often 
played in pairs). Another class of single-headed, low-pitched drums (known as 
hand drums) isn't necessarily played in pairs but is often held in the lap or 
strapped across the player's front. Although these drums can be as percussive as 
congas, they're often deeper in tone and often require that the mic(s) be backed 
off in order to allow the sound to develop and/or fully interact with the room. 
In general, a good pickup can be achieved by placing a mic at a distance of 1 
to 3 feet in front of the hand drum's head. Since a large part of the drum's sound 
(especially its low-end power) comes from its back hole, another mic can be 
placed at the lower port at a distance of 6 inches to 2 feet. Since the rear sound 
will be 180° out of phase from the front pickup, the mic's phase should be 
reversed whenever the two signáis are combined. 

Xylophone, vibraphone and marimba 

The most common way to mic a tuned percussion instrument is to place two 
high-quality condenser or extended-range dynamic pickups above the playing 
bars at a spaced distance that's appropriate to the instrument size (following 
the 3 :1 general rule). A coincident stereo pair can help eliminate possible phase 
errors; however, a spaced pair will often yield a wider stereo image. 

Stringed instruments 

Of all the instrumental families, stringed instruments are perhaps the most 
diverse. Ethnic music often uses instruments that range from being single 
stringed to those that use highly complex and developed Systems to produce 
rich and subtle tones. Western listeners have grown accustomed to hearing the 
violin, viola, cello and double bass (both as solo instruments and in an ensem¬ 
ble setting). Whatever the type, stringed instruments vary in their design type 
and in construction to enhance or cut back on certain harmonic frequencies. 
These variations are what give a particular stringed instrument its own charac- 
teristic sound. 
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VIOLIN AND VIOLA 

The frequency range of the violin runs from 196 Hz 
to above 10 kHz. For this reason, a good mic that 
displays a relatively fíat frequency response should 
be used. The violin's fundamental range is from G3 
to E6 (196 to 1300 Hz), and it is particularly impor- 
tant to use a mic that's fíat around the formant fre- 
quencies of 300 Hz, 1 kHz, and 1200 Hz. The 
fundamental range of the viola is tuned a fifth lower 
and contains fewer harmonic overtones. In most 
situations, the violin or viola's mic should be placed 
within 45° of the instrument's front face. The dis- 
tance will depend on the particular style of music and 
the room's acoustic condition. Miking at a greater 
distance will generally yield a mellow, well-rounded 
tone, whereas a closer position might yield a scratchy, 
more nasal quality ... the choice will depend on the 
instrument's tone quality. The recommended miking 
distance for a solo instrument is between 3 and 8 
feet, over and slightly in front of the player (Figure 

4.60). Under studio conditions, a closer mic distance of between 2 and 3 feet 
is recommended. For a fiddle or jazz/rock playing style, the mic can be placed 
at a cióse working distance of 6 inches or less, as the increased overtones help 
the instrument to cut through an ensemble. Under PA (public address) applica- 
tions, distant working conditions are likely to produce feedback (since less 
amplification is needed). In this situation, an electric pickup, contact, or clip- 
type microphone can be attached to the instrument's body or tailpiece. 



FIGURE 4.60 

Example of a typical 
microphone 
placement for the 
violin. 


CELLO 

The fundamental range of the cello is from C2 to C5 (56 to 520 Hz), with 
overtones up to 8 kHz. If the player's line of sight is taken to be 0 o , then the 
main direction of sound radiation lies between 10° and 45° to the right. A 
quality mic can be placed level with the instrument and directed toward the 
sound holes. The chosen microphone should have a fíat response and be placed 
at a working distance of between 6 inches and 3 feet. 


DOUBLEBA55 

The double bass is one of the orchestra's lowest-pitched instruments. The fun¬ 
damentáis of the four-string type reach down to El (41 Hz) and up to around 
middle C (260 Hz). The overtone spectrum generally reaches upward to 7 kHz, 
with an overall angle of high-frequency dispersión being ±15 ° from the player's 
line of sight. Once again, a mic can be aimed at the / holes at a distance of 
between 6 inches and 1.5 feet. 













Microphone Placement echniques 


Voice 

From a shout to a whisper, the human voice is a talented and versatile sound 
source that displays a dynamic and timbrel range that's matched by few other 
instruments. The male bass voice can ideally extend from E2 to D4 (82 to 
294 Hz) with sibilant harmonics extending to 1 2 kHz. The upper soprano voice 
can range upward to 1050 Hz with harmonics that also climb to 12 kHz. 

When choosing a mic and its proper placement, it's important to step back for 
a moment and remember that the most important "device" in the signal chain 
is the vocalist. Let's assume that the engineer/producer hasn't made the classic 
mistake of waiting until the last minute (when the project goes over budget 
and/or into overtime) to record the vocals. ... Good, now the vocalist can relax 
and concéntrate on a memorable performance. Next step is to concéntrate on 
the vocalist's "creature comforts": How are the lighting and temperature set- 
tings? Is the vocalist thirsty? Once done, you can go about the task of choosing 
your mic and its placement to best capture the performance. 

The engineer/producer should be aware of the following traps that are often 
encountered when recording the human voice: 

■ Excessive dynamic range: This can be solved either by mic technique (phys- 
ically moving away from the mic during louder passages) or by inserting 
a compressor into the signal path. Some vocalists have dynamics that 
range from whispers to normal volumes to practically screaming . . . all 
in a single passage. If you optimize your recording levels during a mod- 
erate-volume passage and the singer begins to belt out the lines, then the 
levels will become too "hot" and will distort. Conversely, if you set your 
recording levels for the loudest passage, the modérate volumes will be 
buried in the music. The solution to this dilemma is to place a compres- 
sor in the mies signal path. The compressor automatically "rides" the 
signal's gain and reduces excessively loud passages to a level that the 
system can effectively handle. (See Chapter 12 for more information about 
compression and devices that alter dynamic range.) 

■ Sibilance: This occurs when sounds such as /, s and sh are overly accentu- 
ated. This often is a result of tape saturation and distortion at high levels 
or slow tape speeds. Sibilance can be reduced by inserting a frequeney- 
selective compressor (known as a de-esser) into the chain or through the 
use of modérate equalization. 

■ Excessive bass boost due to proximity effect: This bass buildup often occurs 
when a directional mic is used at cióse working ranges. It can be reduced 
or compensated for by increasing the working distance between the source 
and the mic, by using an omnidirectional mic (which doesn't display a 
proximity bass buildup), or through the use of equalization. 

MIC TOOLS FOR THE VOICE 

Some of the most common tools in miking are used for fixing problems that 
relate to picking up the human voice and to room isolation. 
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Explosive popping p and b sounds often result when turbulent air blasts 
from the mouth strike the mic diaphragm. This problem can be avoided or 
reduced by: 

■ Placing a pop filter over the mic 

■ Placing a mesh windscreen between the mic and the vocalist 

■ Taping a pencil in front of the mic capsule, so as to break up the "plosive" 
air blasts 

■ Using an omnidirectional mic (which is less sensitive to popping, but 
might cause leakage issues). 

Reducing problems due to leakage and inadequate isolation can be handled in 
any number of situational ways, including: 

■ Choice of directional pattern (i.e., choosing a tighter cardioid or hyper- 
cardioid pattern can help reduce unwanted leakage) 

■ Isolating the singer with a fíat or portable isolation device 

■ Isolating the singer in a sepárate iso-booth 

■ Overdubbing the vocals at a later time, keeping in mind that carefully 
isolated "scratch" vocals can help glue the band together and give the 
vocalist a better feel for the song. 

Woodwind instruments 

The flute, clarinet, oboe, saxophone and bassoon combine to make up the 
woodwind class of instruments. Not all modern woodwinds are made of wood 
ñor do they produce sound in the same way. For example, a flute's sound is 
generated by blowing across a hole in a tube, whereas other woodwinds produce 
sound by causing a reed to vibrate the air within a tube. 

Opening or covering finger holes along the sides of the instrument Controls the 
pitch of a woodwind by changing the length of the tube and, therefore, the 
length of the vibrating air column. It's a common misunderstanding that 
the natural sound of a woodwind instrument radiates entirely from its bell or 
mouthpiece. In reality, a large part of its sound often emanates from the finger 
holes that span the instrument's entire length. 

CLARINET 

The clarinet commonly comes in two pitches: the B clarinet, with a lower limit 
of D3 (147 Hz), and the A clarinet, with a lower limit of C3 (139 Hz). The 
highest fundamental is around G6 (1570 Hz), whereas notes an octave above 
middle C contain frequencies of up to 150 Hz when played soffly. This spectrum 
can range upward to 12 kHz when played loudly. The sound of this reeded 
woodwind radiates almost exclusively from the finger holes at frequencies 
between 800 Hz and 3 kHz; however, as the pitch rises, more of the sound 
emanates from the bell. Often, the best mic placement occurs when the 
pickup is aimed toward the lower finger holes at a distance of 6 inches to 1 foot 
(Figure 4.61). 
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FIGURE 4.61 

Typical microphone 
position for the 
clarinet. 


FLUTE 

The flute's fundamental range extends from B3 to about C7 (247 
to 2093 Hz). For médium loud tones, the upper overtone limit 
ranges between 3 and 6 kHz. Commonly, the instrument's sound 
radiates along the player's line of sight for frequencies up to 3 kHz. 
Above this frequency, however, the radiated direction often moves 
outward 90° to the players right. When miking a flute, placement 
depends on the type of music being played and the room's overall 
acoustics. When recording classical flute, the mic can be placed 
on-axis and slightly above the player at a distance of between 3 
and 8 feet. When dealing with modern musical styles, the distance 
often ranges from 6 inches to 2 feet. In both circumstances, the 
microphone should be positioned at a point 1/3 to 1/2 the dis¬ 
tance from the instrument's mouthpiece to its footpiece. In this 
way, the instrument's overall sound and tone quality can be picked 
up with equal intensity (Figure 4.62). Placing the mic directly in 
front of the mouthpiece will increase the level (thereby reducing 
feedback and leakage); however, the full overall body sound won't 
be picked up and breath noise will be accentuated. If mobility is 
important, an integrated contact pickup can be used or a clip mic 
can be secured near the instrument's mouthpiece. 

SAXOPHONE 



FIGURE 4.62 

Typical microphone 

position for the flute. HARMONICA 


Saxophones vary greatly in size and shape. The 
most popular sax for rock and jazz is the S-curved 
B-flat tenor sax, whose fundamentáis span from 
B2 to F5 (117 to 725 Hz), and the E-flat alto, 
which spans from C3 to G5 (140 to 784 Hz). 
Also within this family are the straight-tubed 
soprano and sopranino, as well as the S-shaped 
baritone and bass saxophones. The harmonic 
content of these instruments ranges up to 8 kHz 
and can be extended by breath noises up to 
13 kHz. As with other woodwinds, the mic 
should be placed roughly in the middle of the 
instrument at the desired distance and pointed 
slightly toward the bell (Figure 4.63). Keypad 
noises are considered to be a part of the instru¬ 
ment's sound; however, even these can be 
reduced or eliminated by aiming the micro¬ 
phone closer to the bell's outer rim. 


Harmónicas come in all shapes, sizes and keys ... and are divided into two basic 
types: the diatonic and the chromatic. Their pitch is determined purely by the 
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FIGURE 4.63 

Typical microphone positions for the saxophone: (a) standard placement; (b) typical “clip-on” placement. 


length, width and thickness of the various vibrating metal reeds. The "harp" 
players habit of forming his or her hands around the instrument is a way to 
mold the tone by forming a resonant cavity. The tone can be deepened and a 
special "wahing" effect can be produced by opening and closing a cavity that's 
formed by the palms; consequently, many harmónica players carry their pre- 
ferred microphones with them (Figure 4.64) rather than being stuck in front of 
an unfamiliar mic and stand. 

MICROPHONE SELECTION 

The following information is meant to provide insights into a limited number 
of professional mies that are used for music recording and professional 
sound applications. This list is by no means complete, as literally hundreds of 
mies are available, each with its own particular design, sonic character and 
application. 

Shure SM57 

The SM57 (Figure 4.65) is widely used by engineers, artists, touring sound 
companies, etc., for instrumental and remóte recording applications. The SM57's 
midrange presence peak and good low-frequency response make it useful for 
use with vocals, snare drums, toms, kick drums, electric guitars and keyboards. 
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FIGURE 4.64 

The Shure 520DX “Green Bullet” microphone, a preferred harmónica pickup for many musicians. (Courtesy of Shure 
Brothers, Inc., www.shure.com.) 


Specifications: 

■ Transducer type: moving-coil dynamic 

■ Polar response: cardioid 

■ Frequency response: 40 to 15,000 Hz 

■ Equivalent noise rating: -7.75 dB (0 dB = 1 V/microbar). 

AKG D112 

Large-diaphragm cardioid dynamic mies, such as the AKG DI 12 (Figure 4.66), 
are often used for picking up kick drums, bass guitar cabinets, and other low- 
frequeney, high-output sources. 

Specifications 

■ Transducer type : moving-coil dynamic 

■ Polar response: cardioid 

■ Frequency response: 30 to 17,000 Hz 

■ Sensitivity: -54 dB ± 3 dB re. 1 V/microbar. 


Beyerdynamic M160 

The Beyer M160 ribbon microphone (Figure 4.67) is capable of handling high 
sound-pressure levels without sustaining damage while providing the transpar- 
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FIGURE 4.65 

Shure SM57 dynamic microphone. (Courtesy of Shure Brothers, Inc., www.shure.com.) 


ency that often is inherent in ribbon mies. Its hypercardioid response yields a 
wide-frequeney response/low-feedback characteristic for both studio and stage. 

Specifications 


■ Transducer type: ribbon dynamic 

■ Polar response: hypercardioid 

■ Frequency response: 40 to 18,000 Hz 

■ Sensitivity: 52 dB (0 dB = 1 mW/Pa) 

■ Equivalent noise rating: -145 dB 

■ Output impedance: 200 Q. 
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FIGURE 4.66 

AKG D112 dynamic microphone. (Courtesy of AKG Acoustics, Inc., www.akg.com.) 


Royer Labs R-121 

The R-121 is a ribbon mic with a figure-8 pattern (Figure 4.68). Its sensitivity is 
roughly equal to that of a good dynamic mic, and it exhibits a warm, realistic 
tone and fíat frequency response. Made using advanced materials and cutting- 
edge construction techniques, its response is fíat and well balanced; the low end 
is deep and full without getting boomy, mids are well defined and realistic, and 
the high-end response is sweet and natural sounding. 

Specifications 

■ Acoustic operating principie: electrodynamic pressure gradient 

■ Polar pattern: figure 8 

■ Generating element: 2.5-micron aluminum ribbon 

■ Frequency response: 30 to 15,000 Hz ± 3 dB 

■ Sensitivity: -54 dBV re. 1 V/Pa ± 1 dB 

■ Output impedance: 300 Q at 1 K (nominal); 200 Q optional 

■ Máximum SPL: >135 dB. 
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FIGURE 4.67 

Beyerdynamic MI 60 ribbon microphone. (Courtesy of Beyerdynamic, www.beyerdynamic.com.) 


Neumarm KM 180 Series 

The 180 Series consists of three compact miniature microphones (Figure 4.69): 
the KM 183 omnidirectional and KM 185 hypercardioid microphones and the 
successful KM 184 cardioid microphone. All 180 Series microphones are avail- 
able with either a matte black or nickel finish and come in a folding box with 
a windshield and two stand mounts that permit connection to the microphone 
body or the XLR-connector. 

Specifications 

■ Transducer type: condenser 

■ Polar response: cardioid (183), cardioid (184) and hypercardioid (185) 

■ Frequency response: 20 to 20 kHz 

■ Sensitivity: 12/15/10 mV/Pa 

■ Output impedance: 50 Q 

■ Equivalent noise level: 16/16/18 dB -A. 
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FIGURE 4.68 

Royer Labs R-121 ribbon microphone. (Courtesy of Royer Labs, www.royerlabs.com.) 


AKGC3000B 

The AKG C3000B (Figure 4.70) is a low-cost, large-diaphragm condenser mic. 
Its design incorporales a bass roll-off switch, a -10-dB pad and a highly effective 
internal windscreen. The mic's dual-diaphragm capsule design is floated in an 
elastic suspensión for improved rejection of mechanical noise. 

Specifications 

■ Transducer type: condenser 

■ Polar response: cardioid 

■ Frequency response: 20 to 20,000 Hz 

■ Sensitivity: 25 mV/Pa (-32 dBv). 




























































































Microphones: Design and Application iAPI R 









1 kHi 


• MMIHMIIMMIIN 16 kHi 


FIGURE 4.69 

Neumann KM 180 Series condenser microphones. (Courtesy of Georg Neumann GMBH, www.neumann.com.) 


MXL V67i 

The MXL V67i's design (Figure 4.71) ineludes two selectable diaphragms, with 
the front side having a warm sound, and the mic's back side being used to 
produce a brighter more airy sound. A bright red LED shines through the grill 
indicating which capsule is energized. 

Specifications 

■ Type: selectable capsule condenser microphone 

■ Frequency range: 30 to 20 kHz 

■ Polar pattern: cardioid 

■ Sensitivity: 15 mV/Pa 

■ Impedance: 200 W 

■ Signal-to-noise ratio: 74 dB (ref. 1 Pa A-weighted) 

■ Equivalent noise level: 20 dB (A-weighted IEC 268-4) 

■ Max SPL for 0.5% THD: 140 dB 
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FIGURE 4.70 

The AKG C3000 B condenser microphone. (Courtesy of AKG Acoustics, Inc., www.akg.com.) 


FIGURE 4.71 

The Marshal MXL 
V67icondenser 
microphone. 
(Courtesy of Marshall 
Electronics, Inc., 
www.mxlmics.com.) 

































































































































Microphones: Design and Application 1APT R 



FIGURE 4.72 

Telefunken M216 
stereo microphone. 
(Courtesy of 
Telefunken USA, 
www.telefunkenusa. 
com.) 


■ Power requirements: Phantom power 48 V ± 4 V 

■ Current consumption: <3.0 mA. 

Telefunken M216 stereo mic 

The Telefunken M216 matrix design (Figure 4.72) features a matched pair of 
1-inch dual-sided capsules, a new, old-stock NOS ECC81 vacuum tube, matched 
custom-wound transformers, and a custom power supply with stereo matrix and 
Z40 encoding/decoding settings. Each channel offers nine different polar pat- 
terns. Cardioid, omni and figure-8 capabilities and numerous in-between pat- 
terns result in an extensive range of stereo imaging possibilities. 

Specifications 

■ Type: selectable pattern stereo condenser microphone 

■ Frequency range: 20 to 20 kHz 

■ Polar pattern: cardioid, omni, figure-8 and six in-between patterns 

■ Sensitivity: 14 mV/Pa 

■ Impedance: 200 W 

■ Signal-to-noise ratio: 125 dB 

■ Equivalent noise level: 20 dB 

■ Power requirements: Ela M916 with quadraphonic outputs. 











This page intentionally left blank 



CHAPTER 5 


The Analog Tape 
Recorder 


From its inception in Germany in the late 1920s and its American introduction 
by Jack Mullin in 1945 (Figure 5.1), the analog tape recorder (or ATR) had steadily 
increased in quality and universal acceptance to the point that professional and 
personal studios had totally relied upon magnetic media for the storage of 
analog sound onto reels of tape. With the dawning of the project studio and 
computer-based DAWs, the use of two-channel and multitrack ATRs has steadily 
dwindled to the point where no new analog tape machine models are currently 
being manufactured. In short, recording to analog tape has steadily become a 
high-cost, future-retro, "specialty" process for getting a certain sound. This being 
said, the analog recording process is still highly regarded and even sought after 
by many studios as a special sonic tool ... and by others as a raised fist against 
the onslaught of the "evil digital empire." Without delving into the ongoing 
debate of the merits of analog versus digital, I think it's fair to say that each has 
its own distinct type of sound and application in audio and music production. 
Although professional analog recorders are usually much more expensive than 
their digital counterparts, as a general rule, a properly aligned, professional 
analog deck will have a particular sound that's often described as being fu.ll, 
punchy, gutsy and "raw" (when used on drums, vocals, entire mixes or anything 
that you want to throw at it). In fact, the limitations of tape are often used as 
a form of "artistic expression." From this, it's easy to see and hear why the analog 
tape recorder isn't dead yet ... and probably won't be for some time. 

TO 2-INCH OR NOT TO 2-INCH? 

Before we delve into the inner workings of the analog tape recorder, let's take 
a moment to discuss ways in which the analog tape sound can be taken advan- 
tage of in the digital and project studio environment. Before you go out and 
buy your own deck, however, there are other cost-effective ways to get "that 
sound" on your own projects. For example: 

■ Make use of plug-ins that can emulate (or approximate) the overdriven 
sound of an analog tape track. 

■ Rent a studio that has an analog multitrack for a few hours or days. You 
could record specific tracks to tape, transfer existing digital tracks to tape 



Magnetic Recording and Its Media 


FIGURE 5.1 

John T. (Jack) Mullin 
(on the left) proudly 
displaying his two 
WWII vintage Germán 
Magnetophones, 
which were the first 
two tape-based 
recorders in the 
United States. 
(Courtesy of John T. 
Mullin.) 



or dump an entire final mixdown to tape. For the cost of studio time and 
a reel of tape, you could inject your project with an entirely new type of 
sound (you might consider buying a single reel of multitrack tape that 
can be erased and reused once the takes have been transferred to disk). 

■ Rent an analog machine from a local studio equipment Service. For a 
rental fee and basic cartage charges, you could reap the benefits of having 
an analog ATR for the duration of a project, without any undue financial 
and maintenance overhead. 

A few guidelines should also be kept in mind when recording and/or transfer- 
ring tracks to or from a multitrack recorder: 

■ Obviously, high recording levels add to that sought-after "overdriven" 
analog sound; however, driving a track too hard (hot) can actually kill 
a track's definition or "air." The trick is often to find a center balance 
between the right amount of saturation and downright distortion. 

■ Noise reduction can be a good thing, but it can also diminish what is 
thought of as that "classic analog sound." Newer, wide tape width record¬ 
ers (such as ATR Services' ATR-102 1-inch, two-track and the 108C 2-inch, 
eight-track recorder), as well as older 2-inch, 16-track recorders, can 
provide improved definition without the need for noise reduction. 

MAGNETIC RECORDING AND ITS MEDIA 

At a basic level, an analog audio tape recorder can be thought of as a sound 
recording device that has the capacity to store audio information onto a mag- 
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netizable tape-based médium and then play this information back at a later 
time. By definition, analog refers to something that's "analogous," similar to or 
comparable to something else. An ATR is able to transform an electrical input 
signal directly into a corresponding magnetic energy that can be stored onto 
tape in the form of magnetic remnants. lipón playback, this magnetic energy 
is then reconverted back into a corresponding electrical signal that can be ampli- 
fied, mixed, processed and heard. 

The recording media itself is composed of several layers of material, each serving 
a specific function (Figure 5.2). The base material that makes up most of a tape's 
thickness is often composed of polyester or polyvinyl chloride (PVC), which is 
a durable polymer that's physically strong and can withstand a great deal of 
abuse before being damaged. Bonded to the PVC base is the all-important layer 
of magnetic oxide. The molecules of this oxide combine to create some of the 
smallest known permanent magnets, which are called domains (Figure 5.3a). On 
an unmagnetized tape, the polarities of these domains are randomly oriented 
over the entire surface of the tape. The resulting energy forcé of this random 
magnetization at the reproduce head is a general cancellation of the combined 
domain energies, resulting in no signal at the recorder's output (except for the 
tape noise that occurs due to the residual domain energy output). 

When a signal is recorded, the magnetization from the record head polarizes 
the individual domains (at varying degrees in positive and negative angular 
directions) in such a way that their average magnetism produces a much larger 
combined magnetic flux (Figure 5.3b). When the tape is pulled across the play¬ 
back head at the same, constant speed at which it was recorded, this alternating 
magnetic output is then converted back into an alternating signal that can then 
be amplified and further processed for reproduction. 



Top Coating 
Magnetic Oxide 
Polyester Base (PVC) 
Antistatic Backing 


FIGURE 5.2 

Structural layers of 
magnetic tape. 
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FIGURE 5.3 

Orientation of 
magnetic domains on 
unmagnetized and 
magnetized recording 
tape: (a) The random 
orientation of an 
unmagnetized tape 
results in no output. 
(b) Magnetized 
domains result in an 
average flux output 
at the magnetic 
head. 
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FIGURE 5.4 

Otari MX-5050 B3 two-channel recorder. (Courtesy of 
Otari Corporation, www.otari.com.) 


THE PROFESSIONAL 
ANALOG ATR 

Professional analog ATRs can be found in 2-, 4-, 
8-, 16- and 24-track formats. Each configuration 
is generally best suited to a specific production 
and postproduction task. For example, a 2-track 
ATR is generally used to record the final stereo 
mix of a project (Figures 5.4 and 5.5), whereas 
8-, 16- and 24-track machines are obviously used 
for multitrack recording (Figures 5.6 and 5.7). 
Although no professional analog machines are 
currently being manufactured, quite a few decks 
can be found on the used market in varying 
degrees of working condition. Certain recorders 
(such as the ATR-108C 2-inch, multitrack/mas- 
tering recorder) can be switched between tape 
width and track formats, allowing the machine 
to be converted to handle a range of multitrack, 
mixdown and mastering tasks. 

THE TAPE TRANSPORT 

The process of recording audio onto magnetic 
tape depends on the transport's capability to 
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FIGURE 5.5 

ATR-102 1-inch 
stereo mastering 
recorder. (Courtesy 
of AIR Service 
Company, www. 
atrservice.com.) 



FIGURE 5.6 

ATR-108C 2-inch multitrack/mastering recorder. 

(Courtesy of ATR Service Company, www.atrservice.com.) 



FIGURE 5.7 

Studer A827 analog multitrack recorder with autolocator. 
(Courtesy of Studer North America, www.studer.ch.) 






























































The Tape Transport 


tape direction 



FIGURE 5.8 

Relationship of time 
to the physical length 
of recording tape. 
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pass the tape across a head path at a constant speed and with a uniform tensión. 
In simpler words, a recorder must uniformly pass a precise length of tape 
over the record head within a specific time period (Figure 5.8). During playback, 
this relationship is maintained by again moving the tape across the heads at 
the same speed, thereby preserving the program's original pitch, rhythm and 
duration. 

This constant speed and tensión movement of the tape across a head's path is 
initiated by simply pressing the Play button. The drive can be disengaged at any 
time by pressing the Stop button, which applies a simultaneous breaking forcé 
to both the left and right reels. The Fast Forward and Rewind buttons cause the 
tape to rapidly shuttle in the respective directions in order to lócate a specific 
point. Initiating either of these modes engages the tape lifters, which raise the 
tape away from the heads (definitely an ear-saving feature). Once the play mode 
has been engaged, pressing the Record button allows audio to be recorded onto 
any selected track or tracks. 

Beyond these basic Controls, you might expect to run into several differences 
between transports (often dependingon the machines age). For example, older 
recorders might require that both the Record and Play buttons be simultane- 
ously pressed in order to go into record mode; while others may begin record¬ 
ing when the Record button is pressed while already in the Play mode. 

On certain older professional transports (particularly those wonderful Ampex 
decks from the 1950s and 1960s), stopping a fast-moving tape by simply press¬ 
ing the Stop button can stretch or destroy a master tape, because the inertia is 
simply too much for the mechanical brake to deal with. In such a situation, a 
procedure known as "rocking" the tape is used to prevent tape damage. The 
deck can be rocked to its stop position by engaging the fast-wind mode in the 
direction opposite the current travel direction until the tape slows down to a 
reasonable speed ... at which point it's safe to press the Stop button. 

In recent decades, tape transport designs have incorporated total transport logic 
(TTL), which places transport and monitor functions under microprocessor 
control. This has a number of distinct advantages in that you can push the Play 
or Stop buttons while the tape is in fast-wind mode without fear of tape damage. 
With TTL, the recorder can sense the tape speed and direction and then auto- 
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matically rock the transport until the tape can safely be stopped or can slow the 
tape to a point where the deck can seamlessly slip into play or record mode. 


Most modern ATRs are equipped with a shuttle control that enables the tape to 
be shuttled at various wind speeds in either direction. This allows a specific cue 
point to be located by listening to the tape at varying play speeds, or the control 
can be used to gently and evenly wind the tape onto its reel at a slower speed 
for long-term storage. The Edit button (which can be found on certain profes- 
sional machines) often has two operating modes: stop-edit and dump-edit. If 
the Edit button is pressed while the transport is in the stop mode, the left and 
right tape reel brakes are released and the tape sensor is bypassed. This makes 
it possible for the tape to be manually rocked back and forth until the edit point 
is found. Often, if the Edit button is pressed while in the play mode, the take-up 
turntable is disengaged and the tape sensor is bypassed. This allows unwanted 
sections of tape to be spooled off the machine (and into the trash can) while 
listening to the material as it's being dumped during playback. 

A safety switch, which is incorporated into all professional transports, initiates 
the stop mode when it senses the absence of tape along its guide path; thus, 
the recorder stops automatically at the end of a reel or should the tape acciden- 
tally break. This switch can be built into the tape-tension sensor, or it might 
exist in the form of a light beam that's interrupted when tape is present. 


tape motion 


S S * 


V;; - 


pote pie ce 


Most professional ATRs are equipped with automatic tape counters that accu- 
rately read out time in hours, minutes, seconds and sometimes frames 
(00:00:00:00). Many of these recorders have digital readout displays that double 
as tape-speed indicators when in the "vari-speed" 
mode. This function incorporates a control that 
lets you vary the tape speed from fixed industry 
standards. On many tape transports, this control 
can be continuously varied over a ±20% range 
from the iy 2 , 15 or 30 ips (inches per second) 
standard. 


THE MAGNETIC TAPE HEAD 

Most professional analog recorders use three 
magnetic tape heads, each of which performs a 
specialized task: 

■ Record 

■ Reproduce 

■ Erase. 

The function of a record head (Figure 5.9) is to 
electromagnetically transform analog electrical 
signáis into corresponding magnetic fields that 
can be permanently stored onto magnetic tape. 
In short, the input current flows through coils of 


FIGURE 5.9 

The record head. 
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The Magnetic Tape Head 


FIGURE 5.10 

The playback head. 


wire that are wrapped around the head's magnetic pole pieces. Since the theory 
of magnetic induction States that "whenever a current is injected into metal, a 
magnetic field is created within that metal" ... a magnetic forcé is caused to 
flow through the coil, into the pole pieces and across the head gap. Like electric- 
ity, magnetism flows more easily through some media than through others. The 
head gap between poles creates a break in the magnetic field, thereby creating 
a physical resistance to the magnetic "circuit." Since the gap is in physical 
contact with the moving magnetic tape, the tape's magnetic oxide offers a lower 
resistance path to the field than does the nonmagnetic gap. Thus, the flux path 
travels from one pole piece, into the tape and to the other pole. Since the mag¬ 
netic domains retain their polarity and magnetic intensity as the tape passes 
across the gap, the tape now has an analogous magnetic "memory" of the 
recorded event. 

The reproduce or playback head (Figure 5.10) operates in a way that's opposite 
to the record head. When a recorded tape track passes across the reproduce head 
gap, a magnetic flux is induced into the pole pieces. Since the theory of magnetic 
induction also States "whenever a magnetic field cuts across metal, a current 
will be set up within that metal" ... an alternating current is caused to flow 
through the pickup coil windings, which can then be amplified and processed 
into a larger output signal. 

Note that the reproduce head's output is nonlinear because this signal is pro- 
portional to both the tape's average flux magnitude and the rate of change of 
this magnetic field. This means that the rate of change increases as a direct fuñe- 
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tion of the recorded signal's frequency. Thus, the output level of a playback head 
effectively doubles for each doubling in frequency, resulting in a 6-dB increase 
in output voltage for each increased octave. The tape speed and head gap width 
work together to determine the reproduce head's upper-frequency limit, which 
in turn determines the system's overall bandwidth. The wavelength of a signal 
that's recorded onto tape is equal to the speed at which tape travels past the 
reproduce head, divided by the frequency of the signal; therefore, the faster the 
tape speed, the higher the upper-frequency limit. Similarly, the smaller the head 
gap, the higher the upper-frequency limit. 

The function of the erase head is to effectively reduce the average magnetization 
level of a recorded tape track to zero, thereby allowing the tape track to be re- 
recorded. After a track is placed into the record mode, a high-frequency and 
high-intensity sine-wave signal is fed into the erase head (resulting in a tape 
that's being saturated in both the positive- and negative-polarity directions). 
This alternating saturation occurs at such a high speed that it serves to confuse 
any magnetic pattern that existed on the tape. As the tape moves away from the 
erase head, the intensity of the magnetic field decreases, leaving the domains 
in a random orientation, with a resulting average magnetization or output level 
that's as cióse to zero as tape noise will allow. 

EQUALIZATION 

Equalization (EQ) is the term used to denote an intentional change in relative 
amplitudes at different frequencies. Because the analog recording process isn't 
linear, equalization is needed to achieve a fíat frequency-response curve when 
using magnetic tape. The 6-dB-per-octave boost that's inherent in the playback 
head's response curve requires that a complementary equalization cut of 6 dB 
per octave be applied within the playback circuit (see Figure 5.11). 

BIAS CURRENT 

In addition to the nonlinear changes that occur in playback level relative to 
frequency, another discrepancy in the recording process exists between the 
amount of magnetic energy that's applied to the record head and the amount 
of magnetism that's retained by the tape after the initial recording field has been 
removed. As Figure 5.12a shows, the magnetization curve of tape is linear 
between points A and B, as well as between points C and D. Signáis greater than 
A and D have reached the saturation level and are subject to clipping distortion. 
Signáis falling within the B to C range are too low in flux level to adequately 
magnetize the domains during the recording process. For this reason, it's impor- 
tant that low-level signáis be boosted so that they're pushed into the linear 
range. This boost is applied by mixing an AC bias current (Figure 5.12b) with 
the audio signal. This bias current is applied by mixing the incoming audio 
signal with an ultrasonic sine-wave signal (often between 75 and 150 kHz). The 
combined signáis are amplitude modulated in such a way that the overall mag¬ 
netic flux levels are given an extra "oomph," which effectively boosts the signal 
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FIGURE 5.11 

A fíat frequency 
playback curve 
results due to 
complementary 
equalization in the 
playback Circuit. 
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above the nonlinear zero-crossover range and into the linear portion of the 
curve. In fact, if this bias signal weren't added, distortion levels would be so 
high as to render the recording process useless. 


MONITOR NG MODES 

The output signal of a professional ATR channel can be switched between three 
working modes: 


■ Input 

■ Reproduce 

■ Sync. 


In the input (source) mode, the signal at the selected channel output is derived 
from its input signal. Thus, with the ATR transport in any mode (including 
stop), it's possible to meter and monitor a signal that's present at a channel's 
selected input. In the reproduce mode, the output and metering signal is derived 
from the playback head. This mode can be useful in two ways: It allows previ- 
ously recorded tapes to be played back, and it enables the monitoring of mate¬ 
rial ofif of the tape while in the record mode. The latter provides an immediate 
quality check of the ATR's entire record and reproduce process. The sync mode 
(originally known as selective synchronization, or sel-sync) is a required feature 
in analog multitrack ATRs because of the need to record new material on one 
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FIGURE 5.12 

The effects of bias 
current on recorded 
linearity: (a) 
magnetization curve 
showing distortion at 
lower levels; (b) after 
bias, when the signal 
is boosted back into 
the curve’s linear 
regions. 


or more tracks while simultaneously monitoring tracks that have been previ- 
ously recorded (during a process called overdubbing). Here's the deal ... using 
the record head to lay down one or more tracks while listening to previously 
recorded tracks through the reproduce head would actually cause the newly 
recorded track(s) to be out of sync with the others on final playback (due to 
the physical distance between the two, as shown in Figure 5.13a). To prevent 
such a time lag, all of the reproduced tracks must be monitored off of the record 
head at the same time that record tracks are being laid down onto the same 
head. Since the record head is used for both recording and playback, there is 
no physical time lag and, thus, no signal delay (Figure 5.13b). 




















































































To Punch or Not to Punch 
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FIGURE 5.13 

The sync mode’s 
function. (a) In the 
monitored playback 
mode, the recorded 
signal lags behind 
the recorded signal, 
thereby creating an 
out-of-sync condi- 
tion. (b) In the sync 
mode, the record 
head acts as both 
record and playback 
head, bringing the 
signáis into sync. 
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TO PUNCH OR NOT TO PUNCH 

You've all heard the age-old adage ... "$%& happens." Well, it happens in the 
studio—a lot! Whenever a mistake or bad line occurs during a multitrack 
session, it's often (but not always) possible to punch-in on a specific track or set 
of tracks. Instead of going back and re-recording an entire song or overdub, 
performing a punch involves going back and re-recording over a specific section 
in order to fix a bad note, musical line—you ñame it. This process is done by 
cueing the tape at a logical point before the bad section and then pressing Play. 
Just before the section to be fixed, pressing the Record button (or entering record 
under automation) will place the track into record mode. At the section's end, 
pressing the Play button again causes the track to fall back out of record, thereby 
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preserving the section following the punch. From a monitor standpoint, the 
recorder begins playback in the sync mode; once placed into record, the track 
switches to monitor the input source. This lets the performers hear themselves 
during the punch while listening to playback both before and after the take. 

When performing a punch, it's often far better to "fix" the track immediately 
after the take has been recorded, while the levels, mic positions and perfor¬ 
mance vibe are still the same. This also makes it easier to go back and re-record 
the entire song or a larger section should the punch not work. If the punch can't 
be performed at that time, however, it's generally a good idea to take detailed 
notes about mic selection, placement, preamps and so on to recreate the ses- 
sion's setup without having to guess the details from memory. 

As every experienced engineer/producer knows, performing a punch can be 
tricky. In certain situations, it's a complete no-brainer ... for example, when a 
stretch of silence the size of a Mack truck exists both before and after the bad 
section, you'll have plenty of room to punch in and out. At other times, a punch 
can be very tight or problematic (e.g., if there's very little time to punch in or 
out, when trying to keep vocal lines fluid and in-context, when it's hard to feel 
the beat of a song or if it has a fast rhythm). In short, punching-in shouldn't 
be taken too lightly ... ñor so seriously that you're afraid of the process. Talk it 
over with the producer and/or musicians. Is this an easy punch? Does the 
section really need fixing? Do we have the time right now? Or, is it better just 
to redo the song? In short, the process is totally situational and requires atten- 
tion, skill, experience and sometimes a great deal of luck. 

■ Before committing the punch to tape, it's often a wise idea to rehearse the 
punch, without actually committing the fix to tape. This has the advantage 
of giving both you and the performer a chance to practice beforehand. 

■ Some analog decks (and most DAWs) will let you enter the punch-in and 
punch-out times under automation, thereby allowing the punch to be 
surgically performed automatically. 

■ If you're recording onto the same track, a fudged punch may leave you 
with few options other than to re-record the entire song or section of a 
song. An alternative to this dilemma would be to record the fix into a 
sepárate track and then switch between tracks in mixdown. 

■ The track(s) could be transferred to a DAW, where the edits could be per¬ 
formed in the digital domain. 


TAPE, TAPE SPEED AND HEAD CONFIGURATIONS 

Professional analog ATRs are currently available in a wide range of track- and 
tape-width configurations. The most common analog configurations are 2-track 
mastering machines that use tape widths of 1/4 inch, 1/2 inch, and even 1 inch, 
as well as 16- and 24-track machines that use 2-inch tape. Figure 5.14 
details many of the tape formats that can be currently found. Optimal tape-to- 
head performance characteristics for an analog ATR are determined by several 
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FIGURE 5.14 

Analog track 
configurations for 
various tape widths. 
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parameters: track width, head-gap width and tape speed. In general, track 
widths are on the order of 0.080 inch for a 1/4-inch 2-track ATR; 0.070 inch 
for 1/2-inch 4-track, 1-inch 8-track, and 2-inch 16-track formats or 0.037 inch 
for the 2-inch 24-track format. As you might expect, the greater the recorded 
track width, the greater the amount of magnetism that can be retained by the 
magnetic tape, resulting in a higher output signal and an improved signal-to- 
noise ratio. The use of wider track widths also makes the recorded track less 
susceptible to signal-level dropouts. 

The most common tape speeds used in audio production are 15 ips (38 cm/ 
sec) and 30 ips (76 cm/sec). Although 15 ips will eat up less tape, 30 ips has 
gained wide acceptance in recent years for having its own characteristic sound 
(often having a tighter bottom end), as well as a higher output and lower noise 
figures (which in certain cases eliminate the need for noise reduction). On the 
other hand, 15 ips has a reputation for having a "gutsy," rugged sound. 

PRINT-THROUGH 

A form of deterioration in a recording's quality, known as print-through, begins 
to occur after a recording has been made. This effect is the result of the transfer 
of a recorded signal from one layer of tape to an adjacent track layer by means 
of magnetic induction, which gives rise to an audible false signal or pre-echo 
on playback. The effects of print-through are greatest when recording levels are 
very high, and the effect decreases by about 2 dB for every 1-dB reduction in 
signal level. The extent of this condition also depends on such factors as length 
of storage, storage temperature and tape thickness (tapes with a thicker base 
material are less likely to have severe print-through problems). 
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FIGURE 5.15 

Recorded analog 
tapes should always 
be stored in the 
tails-out position. 


Because of the effects of print-through, the standard method of professionally 
storing a recorded analog tape is in the tails-out position. Remember: 

■ Professional analog tape should always be stored tails-out (on the right- 
hand take-up reel). 

■ lipón playback, the tape should be wound onto the left-most "supply 
reel." 

■ During playback, feed the tape back onto the right-hand take-up reel, after 
which time it can again be removed for storage. 

■ If the tape has been continuously wound and rewound during the session, 
it's often wise to rewind the tape and then smoothly play or slow-wind 
the tape onto the take-up reel, after which time it can be removed for 
storage. 

So why do we go through all this trouble? When a tape is stored tails-out (Figure 
5.15), the print-through will bleed to the outer layers, a condition that causes 
the echo to follow the original signal in a way that's similar to the sound's 
natural decay and is subconsciously perceived by the listener as reverb instead 
of as an easily-audible pre-echo. 

CLEANLINESS 

It's very important for the magnetic recording heads and moving parts of an 
ATR transport deck to be kept free from dirt and oxide shed. Oxide shed occurs 
when friction causes small particles of magnetic oxide to flake off and accumu- 
late on surface contacts. This accumulation is most critical at the surface of the 
magnetic recording heads, since even a minute separation between the magnetic 


Degaussing 


tape and heads can cause high-frequency separation loss. For example, a signal 
that's recorded at 15 ips and has an oxide shed buildup of 1 mil (0.001 inch) 
on the playback head will be 55 dB below its standard level at 15 kHz. Dena- 
tured (isopropyl) alcohol or an appropriate cleaning solution should be used 
to clean transport tape heads and guides (with the exception of the machine's 
pinch roller and other rubber-like surfaces) at regular intervals. 


DEGAUSSING 

Magnetic tape heads are made from a magnetically soft metal, which means 
that the alloy is easily magnetized ... but once the coil's current is removed, the 
core won't retain any of its magnetism. Small amounts of residual magnetism, 
however, will build up over time, which can actually partially erase high-fre¬ 
quency signáis from a master tape. For this reason, all of the tape heads should 
be demagnetized after 10 hours of operation with a head demagnetizer. This 
handheld device works much like an erase head in that it saturates the magnetic 
head with a high-level alternating signal that randomizes residual magnetic flux. 
Once a head has been demagnetized (after 5 to 10 seconds), it's important to 
move the tool to a safe distance from the tape heads at a speed of less than 2 
inches per second before turning it off, so as to avoid inducing a larger magnetic 
flux back into the head. Before an ATR is aligned, the magnetic tape heads 
should always be cleaned and demagnetized in order to obtain accurate read- 
ings and to protect expensive alignment tapes. 


BACKUP AND ARCHIVE STRATEGIES 

In this day of hard drives, CDs and digital data, we've all come to know the 
importance of backing up our data. With important music and media projects, 
it's equally important to create a tape backup copy in case of an unforeseen 
catastrophe or as added insurance that future generations can enjoy the fruits 
of your work. 

Backing up your project 

The one basic truth that can be said about analog magnetic tape is that this 
médium has withstood the test of time. With care and reconditioning, tapes 
that have been recorded in the 1940s have been fully restored, allowing us to 
preserve and enjoy some of the best music of the day. On the other hand, digital 
data has two points that aren't exactly its favor: 

■ Data that resides on hard drives isn't the most robust of media over time. 
Even CDRs (which are rated to last over 100 years) haven't really been 
proven to last. 

■ Even if the data remains intact, with the ever-increasing advances in Com¬ 
puter technology, who's to say that the media, drives, programs, session 
formáis and file formats will be around in 10 years, let alone 50! 
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These warnings aren't slams against digital, just precautions against the march 
of technology versus the need for media preservation. For the above reasons, 
media preservation is a top priority for such groups as the Recording Academy's 
Producers & Engineers Wing (P&E Wing), as well as for many major record 
labels—so much so that many stipulate in their contracts that multitrack ses- 
sions (no matter what the original médium) are to be transferred and archived 
to 2-inch multitrack analog tape. 

When transferring digital tracks to an analog machine, it's always wise to make 
sure that the recorder has been properly calibrated and that reference tones 
(1 kHz, 10 kHz, 16 kHz and 100 Hz) have been recorded at the beginning of 
the tape. When copying from analog to analog, both machines should be prop¬ 
erly calibrated, but the source for the newly recorded tones should be the master 
tape. If a SMPTE track is required, be sure to stripe the copy with a clean, jam- 
sync code. The backing up of analog tapes and/or digital data usually isn't a big 
problem ... unless you've lost your original masters. In this situation, a proper 
safety master can be the difference between panic and peace. 

Archive strategies 

Just as it's important to back up your media, it's also important that both 
the original and backup media be treated and stored properly. Here are a few 
guidelines: 

■ As stated earlier, always store the tapes tails-out. 

■ Wind the tapes onto the take-up storage reel at slow-wind or play speeds. 

■ Store the boxes vertically. If they're stored horizontally, the outer edges 
could get bent and become damaged. 

■ Media storage facilities exist that can store your masters or backups for a 
fee. If this isn't an option, store them is an area that's cool and dry (e.g., 
no temperature extremes, in low humidity, no attics or basements). 

■ Store your masters and backups in sepárate locations. In case of a fire or 
other disaster ... one would be lost, but not both (always a good idea 
with digital data, as well). 
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CHAPTER 6 


Digital Audio Technology 


It absolutely goes without saying that digital audio technology has changed the 
way in which all forms of media are produced, gathered and distributed. As a 
driving forcé behind human creativity, expression and connectivity, the impact 
of digital media production is simply HUGE, and is an integral part of both the 
médium and the message within modern-day communication. 

In recent years, the way in which music producers and engineers record, manip¬ 
úlate and distribute digital audio has changed dramatically. As with most other 
media, these changes have been brought about by the integration of the per¬ 
sonal Computer into the modern-day project studio environment. Newer gen- 
erations of computers and related hardware peripherals have been integrated 
into both the pro and project studio to record, fold, spindle and mutilate audio 
with astonishing ease. This chapter is dedicated to the various digital system 
types and their relation to the modern-day music production studio. 

THE LANGUAGE OF DIGITAL 

Although digital audio is a varied and complex field of study, the basic theory 
behind the magic curtains isn't all that difficult to understand. At its most ele- 
mentary level, it's simply a process by which numeric representations of analog 
signáis (in the form of voltage levels) are encoded, processed, stored and repro- 
duced over time through the use of a binary number system. 

Just as English-speaking humans communicate by combining any of 26 letters 
together into groupings known as "words" and manipúlate numbers using the 
decimal (base 10) system, the system of choice for a digital device is the binary 
(base 2) system. This numeric system provides a fast and efficient means for 
manipulating and storing digital data. By translating the alphabet, base 10 
numbers or other form of information into a binary language form, a digital 
device (such as a Computer or microprocessor) can perform calculations and 
tasks that would otherwise be cumbersome, less cost effective and/or downright 
impossible to perform in the analog domain. 

To illustrate, let's take a look at how a human construct can be translated into 
a digital language (and back). If we type the letters C, A and T into a word 
processor, the Computer will quickly go about the task of translating these 
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keystrokes into a series of 8-bit digital words represented as fO 100 0011], [0100 
0001] and [0101 0100]. On their own, these digits don't mean much, but when 
these groups are put together, they form a word that represents a four-legged 
animal that's either seldom around or always underfoot (Figure 6.1). 

In a similar manner, a digital audio system works by sampling (measuring) the 
instantaneous voltage level of an analog signal at specific intervals over time, 
and then converting these samples into a series of encoded "words" that digi- 
tally represent the analogous voltage levels. By successively measuring changes 
in an analog signal's voltage level (over time), this stream of representative 
words can be stored in a form that represents the original analog signal. Once 
stored, the data can be processed and reproduced in ways that have changed 
the face of audio production forever. 


It’s interesting to note that binary data can be encoded 
as logical “1” or “0”—“on” or “off”—States, using such 
forms of physics as: 

■ Voltage or no voltage (circuitry) 


■ Magnetic flux or no flux (tape or hard disk) 

■ Reflection off of a surface or no reflection (CD, DVD or 
other optical disc form) 

■ Electromagnetic waves (broadcast transmission). 


From this, you'll hopefully begin to get the idea that human forms of commu- 
nication (i.e., print, visual and audible media) can be translated into a digital 
form that can be easily understood and manipulated by a processor. Once the 
data has been recorded, stored and/or processed, the resulting binary data can 
be reconverted back into a form that can be easily understood by us humans 
(such as a display, readout, printed paper, lit indicator, controller interaction 
... you ñame it). If you think this process of changing one form of energy into 
an analogous form (and then back again) sounds like the general definition of 
a transducer—you're right! 

DIGITAL BASICS 

The following sections provide a basic overview of the various stages that are 
involved in the encoding of analog signáis into equivalent digital data, and the 
subsequent converting of this data back into its original analog form. 
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The encoding and decoding phases of the digitization 
process revolve around two processes: 

■ Sampling (the component of time) 

■ Quantization (the signal-level component). 

In a nutshell, sampling ¡s a process that affects the 
overall bandwidth (frequency range) that can be encoded 


within a sound file, while quantization refers to the 
resolution (overall quality and distortion characteristics) of 
an encoded signal compared to the original analog signal 
at its input. 


Sampling 

In the world of analog audio, signáis are recorded, stored and reproduced as 
changes in voltage levels that continuously vary over time in a continuous 
fashion (Figure 6.2). The digital recording process, on the other hand, doesn't 
opérate in such a manner; rather, digital recording operates by taking periodic 
samples of an analog audio waveform over time (Figure 6.3), and then calculat- 
ing each of these snapshot samples into grouped binary words that digitally 
represent these voltage levels as they change over time, as accurately as possible. 

During this process, an incoming analog signal is sampled at discrete and pre- 
cisely timed intervals (as determined by the sample rate). At each interval, this 
analog signal is momentarily "held" (frozen in time), while the converter goes 
about the process of determining what the voltage level actually is, with a degree 
of accuracy that's defined by the converter's circuitry and the chosen bit rate. 
The converter then generates a binary-encoded word that's numerically equiva- 
lent to the analog voltage level at that point in time (Figure 6.4). Once this is 
done, the converter can store the representative word into a memory médium 
(tape, disk, disc, etc.), release its hold, and then go about the task of determin¬ 
ing the valúes of the next sampled voltage. The process is then continuously 
repeated throughout the recording process. 



FIGURE 6.2 

An analog signal is 
continuous in nature. 
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FIGURE 6.3 

A digital signal 
makes use of 
periodic sampling to 
encode information. 




FIGURE 6.4 

The sampling process. (a) The analog signal is momentarily “held” (frozen in time), while the converter goes about the 
process of determining the voltage level at that point in time and then converting that level into a binary-encoded word 
that’s numerically equivalent to the original analog voltage level. (b) Once this digital information is processed and 
stored, the sample is released and the next sample is held, as the system again goes about the task of determining 
the level of the next sampled voltage ... and so forth, and so forth, and so forth over the duration of the recording. 


Within a digital audio system, the sampling rate is defined as the number of 
measurements (samples) that are periodically taken over the course of a second. 
Its reciprocal (sampling time) is the elapsed time that occurs between each 
sampling period. For example, a sample rate of 44.1 kHz corresponds to a 
sample time of 1/44,100 of a second. 

This process can be likened to a photographer who takes a series of action 
sequence shots. As the number of pictures taken in a second increases, the 
accuracy of the captured event will likewise increase until the resolution is so 
great that you can't tell that the successive, discrete pictures have turned into a 
continuous and (hopefully) compelling movie. Since the process of sampling 
is tied directly to the component of time, the sampling rate of a system deter¬ 
mines its overall bandwidth (Figure 6.5), meaning that a recording made at a 
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(b) 


FIGURE 6.5 

Discrete time sampling. (a) Whenever the sample rate is 
set too low, important data between sample periods will 
be lost. (b) As the rate is increased, more frequency- 
related data can be encoded. (c) Increasing the sampling 
frequency further can encode the recorded signal with an 
even higher bandwidth range. 


higher sample rate will be capable of storing a wider range of frequencies (effec- 
tively increasing the signal's bandwidth at its upper limit). 

Quantization 

Quantization represents the amplitude component of the digital sampling 
process. It is used to transíate the voltage levels of a continuous analog signal 
(at discrete sample points over time) into binary digits (bits) for the purpose 
of manipulating or storing audio data in the digital domain. By sampling the 
amplitude of an analog signal at precise intervals over time, the converter deter¬ 
mines the exact voltage level of the signal (during a sample interval, when the 
voltage level is momentarily held), and then outputs the signal level as an 
equivalent set of binary numbers (as a grouped word of n-bits length) which 
represent the originally sampled voltage level (Figure 6.6). The resulting word 
is used to encode the original voltage level with as high a degree of accuracy as 
can be permitted by the word's bit length and the system's overall design. 

Currently, the most common binary word length for audio is 16-bit (i.e., 
f 1111110101000001]); however, bit depths of20- and 24-bit resolution are also 
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FIGURE 6.6 

The instantaneous 
amplitude of the 
incoming analog 
signal is broken 
down into a series of 
discrete voltage 
steps, which are then 
converted into an 
equivalent binary- 
encoded word. 



in common use. In addition, computers and signal-processing devices are 
capable of performing calculations internally at the 32- and 64-bit resolution 
level. This added internal headroom at the bit level helps reduce errors in level 
and performance at low-level resolutions whenever múltiple audio datastreams 
are mixed or processed within a digital signal processing (DSP) system. This 
greater internal bit resolution is used to reduce errors that might accumulate 
within the least significant bit (LSB; the final and smallest numeric valué within 
a digital word). As múltiple signáis are mixed together and multiplied (a regular 
occurrence in gain change and processing functions), lower-bit-resolution 
numbers play a more important role in determining the signal's overall accuracy 
and distortion. Since the internal bit depth is higher, these resolutions can be 
preserved (instead of being dropped by the system's hardware or software pro¬ 
cessing functions), with a final result being an n-bit datastream that's relatively 
free of errors. This leads us to the conclusión that greater word lengths will often 
directly transíate into an increased resolution (and, within reason, arguably 
higher quality) due to the added number of finite steps into which a signal can 
be digitally encoded. The following details the number of encoding steps that 
are encountered for the most commonly used bit lengths: 


8-bit word = (nnnnnnnn) = 256 steps 
16-bit word = (nnnnnnnn nnnnnnnn) = 65,536 steps 
20-bit word = (nnnnnnnn nnnnnnnn nnnn) = 1,048,576 
steps 

24-bit word = (nnnnnnnn nnnnnnnn nnnnnnnn) 

= 16,777,216 steps 


32-bit word = (nnnnnnnn nnnnnnnn nnnnnnnn nnnnnnnn) 
= 4,294,967,296 steps 
where n = binary 0 or 1. 
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The details of the digital audio record/playback process can get quite detailed 
and complicated, but the essential basics are: 

■ Sampling (in the truest sense of the word) an analog voltage signal at 
precise intervals in time 

■ Converting these samples into a digital word valué that most accurately 
represents these voltage levels 

■ Storing them within a digital memory device. 

lipón playback, these digital words are then converted back into discrete volt- 
ages (again, at precise intervals in time), allowing the originally recorded signal 
voltages to be re-created, processed and played back. 

The devil’s in the details 

Although the basic concept behind the sample-and-hold process is relatively 
straightforward, delving further into the process can quickly bog you down in 
the language of high-level math and physics. Luckily, there are a few additional 
details relating to digital audio that can be discussed at a basic level. 

THE NYQUIST THEOREM 

This basic rule relates to the sampling process ... and States that: 


In order for the desired frequency bandwidth to be frequency to be recorded (sample rate >2 x highest 

faithfully encoded in the digital domain, the selected frequency). 

sample rate must be at least twice as high as the highest 


In plain language, should frequencies that are greater than twice the sample rate 
be allowed into the sampling process, these high frequencies would have a 
periodic nature that's shorter than the sample rate can actually capture. When 
this happens, the successive samples won't be able to accurately capture these 
higher frequencies, and will instead actually record false or "alias" frequencies 
that aren't actually there, but will be heard as harmonic distortion (Figure 6.7). 

From a practical point of view, this would mean that an audio signal with a 
bandwidth of 20 kHz would require that the sampling rate be at least 40 kHz 
samples/sec, and a bandwidth of 22 kHz would require a sampling rate of at 
least 44 kHz samples/sec, etc. 

To eliminate the effects of aliasing, a low-pass filter is placed into the circuit 
before the sampling process takes place. In theory, an ideal filter would pass all 
frequencies up to the Nyquist cutoff frequency and then prevent any frequencies 
above this point from passing. In the real world, however, such a "brick wall" 
doesn't exist. For this reason, a slightly higher sample rate must be chosen in 
order to account for the cutoff slope that's required for the filter to be effective 
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FIGURE 6.7 

Frequencies that 
enter into the 
digitization process 
above the Nyquist 
half-sample 
frequency limit can 
introduce harmonic 
distortion: 

(a) frequencies 
greater than 2x the 
sampling rate limit; 

(b) resulting “alias” 
frequencies that are 
introduced into the 
audio band as 
distortion. 


sampling Nyquist Frequency sample rate 


passed frequencies 
above the Nyquist limit 


|_J 

0 Hz 22.05k 44.1k 

(a) 


sampling Nyquist Frequency sample rate 



(b) 


(Figure 6.8). As a result, an audio signal with a bandwidth of 20 kHz will actu- 
ally be sampled at a standardized rate of 44.1 samples/sec, while a bandwidth 
of roughly 22 kHz would require the use of a sampling rate of at least 48 kHz 
samples/sec, etc. 

OVERSAMPLING 

This sampling-related process is commonly used in professional and consumer 
digital audio Systems to improve the Nyquist filter's anti-aliasing characteristics. 

Oversampling increases the effective sample rate by factors ranging between 12 
and 128 times the original rate. There are three main reasons for this process: 

■ Nyquist filters can be expensive and difficult to properly design. By increas- 
ing the effective sample bandwidth, a simpler and less-expensive filter can 
be used. 

■ Oversampling generally results in a higher-quality analog-to-digital (A/D) 
and digital-to-analog (D/A) converter that sounds better. 
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cut-off 
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FIGURE 6.8 

Anti-alias filtering. 

(a) An ideal filter 
would have an 
infinite attenuation at 
the 20-kHz Nyquist 
cutoff frequency. 

(b) Real-world filters 
require an additional 
frequency “guard- 
band” in order to 
fully attenuate 
unwanted frequen- 
cies that fall above 
the half-bandwidth 
Nyquist limit. 


■ Since múltiple samples are taken of a single sample-and-hold analog 
voltage, the average noise sampling will be lower. 

Following the sample stage, the sampled data is digitally scaled back down to 
the target data rate and bandwidth for further processing and/or storage. 

SIGNAL-TO-ERROR RATIO 

The signal-to-error ratio is used to measure the quantization process. A digital 
system's signal-to-error ratio is closely akin (although not identical) to the analog 
concept of signal-to-noise (S/N) ratio. Whereas a S/N ratio is used to indicate 
the overall dynamic range of an analog system, the signal-to-error ratio of a 
digital audio device indicates the degree of accuracy that's used to capture a 
sampled level and its step-related effects. 

Although analog signáis are continuous in nature, as we've read, the process of 
quantizing a signal into an equivalent digital word isn't. Since the number 
of discrete steps that can be encoded within a digital word limits the accuracy 
of the quantization process, the representative digital word can only be an 
approximation (albeit an extremely cióse one) of the original analog signal 
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level. Given a properly designed system, the signal-to-error ratio for a signal 
coded with n bits is: 


Signal-to-error ratio = 6n + 1.8(dB) 


Therefore, the theoretical signal-to-error ratio for the most 
common bit rates will yield a dynamic range of: 

8-bit word = 49.8 dB 
16-bit word = 97.8 dB 


20-bit word = 121.8 dB 
24-bit word = 145.8 dB 
32-bit word = 193.8 dB 
64-bit word = 385.8 dB 


DITHER 

Also relating to quantization, dither is commonly used during the recording or 
conversión process to increase the overall bit resolution (and therefore low-level 
noise and signal clarity) of a recorded signal, when converting from a higher to 
a lower bit rate. 


FIGURE 6.9 

Valúes falling below 
the least significant 
bit level cannot be 
encoded without the 
use of dither. 


Technically, dither is the addition of very small amounts of randomly generated 
noise to an existing bitstream that allows the S/N and distortion figures to fall 
to levels that approach their theoretical limits. The process makes it possible for 
low-level signáis to be encoded at less than the data's least significant bit level 

(less than a single quantization step, as shown 
in Figure 6.9). You heard that right... by adding 
a small amount of random noise into the A/D 
path, we can actually: 

■ Improve the resolution of the conversión 
process below the least significant bit level. 

■ Reduce harmonic distortion in a way that 
greatly improves the signal's performance. 



The concept of dither relies on the fact that noise 
is random. By adding small amounts of random- 
ization into the quantization process, there is an 
increased probability that the D/A converter will 
be able to guess the least significant bit of a low- 
level signal more accurately. This is due to the 
fact that the noise shapes the detected sample in 
such a way that the sample-and-hold (S/H) cir- 
cuitry can determine the original analog valué 
with greater precisión. 

Dither is often applied to an application or 
process to reduce quantization errors and slight 
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increases in noise and/or fuzziness that could otherwise creep into a bitstream. 
For example, when múltiple tracks are mixed together within a DAW, it's not 
uncommon for digital data to be internally processed at 32- and 64-bit depths. 
In situations like this, dither is often used to smooth and round the data valúes, 
so that the low-level (least significant bit) resolutions won't be lost when they 
are interpolated back to their original target bit depths. 

Dither can also be manually applied to sound files that have been saved at 20- 
and 24-bit depths (or possibly greater by the time you read this). Applications 
and DAW plug-ins can be used to apply dither to a sound file or master mix, 
so as to reduce the effects of lost resolution due to the truncation of least sig¬ 
nificant bits. For example, mastering engineers might experiment with applying 
dither to a high-resolution file before saving or exporting it as a 16-bit final 
master. In this way, noise is reduced and the sound file's overall clarity can be 
increased. 

Fixed- vs. floating-point processing 

Many of the newer digital audio and DAW Systems make use of floating-point 
arithmetic in order to process, mix and output digital audio. The advantage of 
floating- over fixed-point DSP calculations is that the former is able to use 
numeric "shorthand" in order to process a wider range of valúes at any point 
in time. In short, it's able to easily move or "float" the decimal point of a very 
large number in a way that can represent it as a much smaller valué. By doing 
so, the processor is able to internally calcúlate much larger bit depth valúes (i.e., 
32- or 64-bit) with relative ease and increased resolution. 


THE DIGITAL RECORDING/REPRODUCTION 


The following sections provide a basic overview of the various stages within the 
process of encoding analog signáis into equivalent digital data (Figure 6.10a) 
and then converting this data back into its original analog form (Figure 6.10b). 

The recording process 

In its most basic form, the digital recording chain ineludes a low-pass filter, a 
sample-and-hold Circuit, an analog-to-digital converter, circuitry for signal 
coding and error correction. At the input of a digital sampling system, the analog 
signal must be band limited with a low-pass filter so as to stop frequencies that 
are greater than half the sample rate frequeney from passing into the A/D con¬ 
versión circuitry. Such a stopband (anti-aliasing) filter generally makes use of a 
sharp roll-off slope at its high-frequeney cutoff point (oversampling might be 
used to simplify and improve this process). 

Following the low-pass filter, a sample-and-hold (S/H) circuit holds and measures 
the analog voltage level that's present during the sample period. This period is 
determined by the sample rate (i.e., l/44,100th of a second for a 44.1 rate). At 
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FIGURE 6.10 

The digital audio 
Chain: (a) recording; 
(b) reproduction. 



this point, computations are performed to transíate the sampled voltage into 
an equivalent binary word. This step in the A/D conversión is one of the most 
critical components of the digitization process, because the sampled DC voltage 
level must be quickly and accurately quantized into an equivalent digital word 
(to the nearest step level). 

Once the sampled signal has been converted into its equivalent digital form, 
the data must be conditioned for further data processing and storage. This con- 
ditioning ineludes data coding, data modulation and error correction. In general, 
the binary digits of a digital bitstream aren't directly stored onto a recording 
médium as raw data; rather, data coding is used to transíate the data (along 
with synchronization and address information) into a form that allows the data 
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FIGURE 6.11 

Pulse-code 

modulation. 
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FIGURE 6.12 

Example of inter- 
leaved error 
correction. 


to be most efficiently and accurately stored to a memory or storage media. The 
most common form of digital audio data coding is pulse-code modulation, or 
PCM (Figure 6.11). 

The density of stored information within a PCM recording and playback system 
is extremely high ... so much so that any imperfections (such as dust, finger- 
prints or scratches that might adhere to the surface of any magnetic or optical 
recording médium) will cause severe or irretrievable errors to be generated. To 
keep these errors within acceptable limits, several forms of error correction are 
used (depending on the media type). One method uses redundant data in the 
form of parity bits and check codes in order to retrieve and/or reconstruct lost 
data. A second uses error correction that involves interleaving techniques, 
whereby data is deliberately scattered across the digital bitstream, according to 
a complex mathematical pattern. The latter has the effect of spreading the data 
over a larger surface of the recording media, thereby making the recording 
media less susceptible to dropouts (Figure 6.12). In fact, it's a simple truth that 
without error correction, the quality of most digital audio media would be 
greatly reduced or (in the case of the CD and DVD) rendered almost useless. 

The playback process 

In many respects, the digital reproduction chain works in a manner that's com- 
plementary to the digital encoding process. Since most digital media encodes 
data onto media in the form of highly saturated magnetic transition States or 
optical reflections, the recorded data must first be reconditioned in a way that 
restores the digital bitstream back into its original, modulated binary State 
(i.e., a transitional square wave). Once this is done, the encoded data can be 














































































The Digital Recording/Reproduction Process 


FIGURE 6.13 

A stepped resistance 
network is a common 
device for accom- 
plishing D/A 
conversión by 
assigning each word 
bit to a series of 
resistors that are 
scaled by factors of 2. 
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de-interleaved (reassembled) back into its original form, where it can be con¬ 
verted back into PCM data, and the process of D/A conversión can take place. 

Within the D/A conversión process, a stepped resistance network (sometimes 
called an R/2R network) is used to convert the representative words back into 
their analogous voltage levels for playback. During a complementary S/H 
period, each bit within the representative digital word is assigned to a resistance 
leg in the network (moving from the most significant to the least significant 
bit). Each "step" in the resistance leg is then designed to successively pass one- 
half the reference voltage level as it moves down the ladder toward the LSB 
(Figure 6.13). The presence or absence of a logical "1" in each step is then used 
to turn on each successive voltage leg. As you might expect, when all the resis¬ 
tance legs are added together, their voltages equal a precise level that matches 
the originally recorded voltage during the recorded sample period. As these 
voltages are reproduced over time in precise intervals (as determined by the 
sample rate) ... voilá, you have a playback signal! 

Following the conversión process, a final, complementary low-pass filter is 
inserted into the signal path. Again, following the principie of the Nyquist 
theorem, this filter is used to smooth out any of the step-level distortions that 
are introduced by the sampling process, resulting in a waveform that faithfully 
represents the originally recorded analog waveform. 

Sound file sample rates 

The sample rate of a recorded digital audio bitstream directly relates to the reso- 
lution at which a recorded sound will be digitally captured. Using the film 
analogy, if you capture more samples (frames) of a moving image as it moves 
through time, you'll have a more accurate representation of that recorded event. 
If the number of samples are too low, the resolution will be "lossy" and will 
distort the event. On the other hand, taking too many picture frames might 
result in a recorded bandwidth that's so high that the audience won't be able 
to discriminate any advantages that the extra information has to offer ... or the 
storage requirements will become increasingly large as the bandwidth and file 
size increase. 

This analogy relates perfectly to audio because the choices of sample rate will 
be determined by the bandwidth (number of overall frequencies that are to be 
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captured) versus the amount of storage is needed to either save the data to 
a memory storage media ... or the time that will be required to up/download 
a file through a transmission and/or online datastream. Beyond the basic 
adherence to certain industry sample rate standards, such are the personal 
decisions that must be made regarding the choice of sample rate to be used 
on a project. Although other sample-rate standards exist, the following are 
the most commonly used in the professional, project and audio production 
community: 

■ 32k: This rate is often used by broadcasters to transmit/receive digital data 
via satellite. With its overall 15-kHz bandwidth and reduced data require- 
ments, it is also used by certain devices in order to conserve on memory 
and is commonly used in satellite broadcast Communications. Although 
the pro community doesn't generally use this rate, it's surprising just how 
good a sound can be captured at 32k (given a high-quality converter). 

■ 44. lk: The long-time standard of consumer and pro audio production, 
44.1 is the chosen rate of the CD-audio standard. With its overall 20-kHz 
bandwidth, the 44.lk rate is generally considered to be the minimum 
sample rate for professional audio production. Assuming that high-qual¬ 
ity converters are used, this rate is capable of recording lossless audio, 
while conserving on memory storage requirements. 

■ 48k: This standard was adopted early on as a standard sample rate for 
professional audio applications (particularly when referring to hardware 
digital audio devices). It's also the adopted standard rate for use within 
professional video and DVD production. 

■ 88.2k: As a simple múltiple of 44.1, this rate is often used within produc- 
tions that are intended to be high-resolution producís. 

■ 96k: This rate has been adopted as the de facto sample rate for high- 
resolution recordings. 

■ 192k: This high-resolution rate is uncommon within pro audio produc¬ 
tion, as the storage and media requirements are quite high. 

Sound file bit rates 

The bit rate of a digitally recorded sound file directly relates to the number of 
quantization steps that are encoded into the bitstream. As a result, the bit rate 
(or bit depth) is directly correlated to the: 

■ Accuracy at which a sampled level (at one point in time) is to be encoded 

■ Signal-to-error figure ... and thus the overall dynamic range of the recorded 
signal. 

If the bit rate is too low to accurately encode the sample, the resolution will 
lead to quantization errors, which will lead to distortion. On the other hand, 
too high of a bit depth might result in a resolution that's so high that the result- 
ing gain in resolution is lost on the audience's ability to discriminate it ... or 
the storage requirements might become so high that the files become inordi- 
nately large. 
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Although other sound file bit rate standards exist, the following are the 
most commonly used within the pro, project and general audio production 
community: 

m 16 bits: The long-time standard of consumer and professional audio pro¬ 
duction, 16 bits is the chosen bit depth of the CD-audio standard (offer- 
ing a theoretical dynamic range of 97.8 dB). It is generally considered to 
be the minimum depth for high-quality professional audio production. 
Assuming that high-quality converters are used, this rate is capable 
of lossless audio recording, while conserving on memory storage 
requirements. 

■ 20 bits: Before the 24-bit rate carne onto the scene, 20 bits was considered 
to be the standard for high-bit-depth resolution. Although it's used less 
commonly, it can still be found in high-definition audio recordings (offer- 
ing a theoretical dynamic range of 121.8 dB). 

■ 24 bits: Offering a theoretical dynamic range of 145.8 dB, this standard 
bit rate is often used in high-definition audio applications, usually in 
conjunction with the 96k sample rate (i.e., 96/24). 

Further reading on sound file and compression codee specifics can be found in 
Chapter 10 (Multimedia). 

Regarding digital audio levels 

Over the past few decades, the trend toward making recordings that are as loud 
as possible has totally pervaded the industry to the point that it has be given 
the ñame of "The Loudness War." Not only is this practice used in mastering 
to make a song or project stand out in an on-air or in-the-pocket playlist ... it 
has also followed in the analog tradition of recording a track as hot as possible 
to get the best noise figures and punch. All of this is arguably well and good, 
except for the fact that in digital recording a track at too "hot" a level doesn't 
add extra punch—it just adds really nasty distortion. 

The dynamic range of a digital recording ranges from its "footroom" (00000000 
00000000) for a 16-bit recording to its full-scale headroom ceiling of (11111111 
11111111). Average or peak levels above full scale can easily ruin a recording. As 
such, since digital has a wider dynamic range than analog, it's always a good 
idea to reduce your levels so that they peak from -12 to -20 dB. This will accu- 
rately capture the peaks without clipping, without introducing an appreciable 
amount of noise into the mix. Recording at higher bit rates (i.e., 24 bits) will 
further reduce noise levels, allowing for increased headroom when recording at 
reduced levels. Of course, there are no standard guidelines or reference levels 
(digital meters aren't even standardized, for the most part) ... so you might 
want to further research the subject on your own. 

Digital audio transmission 

In the digital age, it's become increasingly common for audio data to be dis- 
tributed throughout a connected production system in the digital domain. In 
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this way, digital audio can be transmitted in its original numeric form and (in 
theory) without any degradation throughout a connected path or system. When 
looking at the differences between the distribution of digital and analog audio, 
it should be kept in mind that, unlike its counterpart, the transmitted band- 
width of digital audio data occurs in the megahertz range; therefore, the trans- 
mission of digital audio actually has more in common with video signáis than 
the lower bandwidth range that's encountered within analog audio. This means 
that care must be exercised to ensure that impedance is more closely matched 
and that quick-fix Solutions don't occur (for example, using a Y-cord to split a 
digital signal between two devices is a major no-no). Failure to follow these 
precautions could seriously degrade or deform the digital signal, causing 
increased jitter and distortions. 

Due to these tight restrictions, several digital transmission standards have been 
adopted that allow digital audio data to be quickly and reliably transmitted 
between compliant devices. These inelude such protocols as: 

■ AES/EBU 

■ S/PDIF 

■ SCMS 

■ MADI 

■ ADAT lightpipe 

■ TDIF. 

AES/EBU 

The AES/EBU (Audio Engineering Society and the European Broadcast Union) 
protocol has been adopted for the purpose of transmitting digital audio between 
professional digital audio devices. This standard (which is most often referred 
to as simply an AES digital connection) is used to convey two channels of 
interleaved digital audio through a single, three-pin XLR mic or line cable in a 
single direction. This balanced configuration connects pin 1 to the signal 
ground, while pins 2 and 3 are used to carry signal data. AES/EBU transmission 
data is low impedance in nature (typically 110 Q) and has digital burst ampli¬ 
tudes that range between 3 and 10 V. These combined factors allow for a 
máximum cable length of up to 328 feet (100 meters) at sample rates of less 
than 50 kHz without encountering undue signal degradation. 

Digital audio channel data and subcode information are transmitted in blocks 
of 192 bits that are organized into 24 words (with each being 8 bits long). 
Within the confines of these data blocks, two subframes are transmitted during 
each sample period that convey information and digital synchronization codes 
for both channels in an L-R-L-R ... fashion. Since the data is transmitted as a 
self-clocking biphase code (Figure 6.14), wire polarity can be ignored. In addi- 
tion, whenever two devices are directly connected, the receiving device will 
usually derive its reference timing dock from the digital source device. 

In the late 1990s, the AES protocol was amended to inelude the "stereo 96k 
dual AES signal" protocol. This was created to address signal degradations that 



The Digital Recording/Reproduction Process 


FIGURE 6.14 

AES/EBU subframe 
format. 





FIGURE 6.15 

S/PDIF connectors: 

(a) RCA coax 
connection; 

(b) Toslink optical 
connection. 



can occur when running longer cable runs at sample rates above 50 kHz. To 
address the problem, the dual AES standard allows stereo sample rates above 
50 kHz (such as 96/24) to be transmitted over two synchronized AES cables 
(with one cable carrying the L information and the other carrying the R). 

S/PDIF 

The S/PDIF (Sony/Phillips Digital Interface) protocol has been widely adopted 
for transmitting digital audio between consumer digital audio devices and their 
professional counterparts. Instead of using a balanced 3-pin XLR cable, the 
popular S/PDIF standard has adopted the single-conductor, unbalanced phono 
(RCA) connector (Figure 6.15a), which conducís a nominal peak-to-peak 
voltage level of 0.5 V between connected devices, with an impedance of 75 Q. 
In addition to using standard RCA cable connections (Figure 6.15a), S/PDIF 
can also be transmitted between devices using Toslink optical connection lines 
(Figure 6.15b), which are commonly referred to as "lightpipe" connectors. 

As with the AES/EBU protocol, S/PDIF channel data and subcode information 
are transmitted in blocks of 192 bits consisting of 12 words that are 16 bits 
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long. A portion of this information is reserved as a category code that provides 
the necessary setup information (sample rate, copy protection status and so on) 
to the copying device. Another portion is set aside for transmitting audio data 
that's used to relay track indexing information (such as start ID and program 
ID numbers), allowing this relevant information to be digitally transferred from 
the master to the copy. It should be noted that the professional AES/EBU pro- 
tocol isn't capable of digitally transmitting these codes during a copy transfer. 

In addition to transmitting two channels in an interleaved L-R-L-R ... fashion, 
S/PDIF is able to communicate multichannel data between devices. Most com- 
monly, this shows up as a direct 5.1 surround-sound link between a DVD player 
and an audio receiver/amplifier playback system (via either an RCA coax or 
optical connection). 

5CM5 

Initially, certain digital recording devices (such as a DAT recorder) were intended 
to provide consumers with a way to make high-quality recordings for their own 
personal use. Soon after its inception, however, for better or for worse, the 
recording industry began to see this new médium as a potential source of 
lost royalties due to home copying and piracy practices. As a result, the RIAA 
(Recording Industry Association of America) and the former CBS Technology 
Center set out to create a "copy inhibitor." After certain failures and long indus¬ 
try deliberations, the result of these efforts was a process that has come to be 
known as the Serial Copy Management System, or SCMS (pronounced "scums"). 
This protocol was incorporated into many consumer digital devices in order to 
prohibit the unauthorized copying of digital audio at 44.1 kHz (SCMS doesn't 
apply to the making of analog copies). Note also that, due to the demise of the 
DAT format (both as a recording and mass-music distribution médium), SCMS 
has pretty much fallen out of favor in both the consumer and pro audio 
community. 

So, what is SCMS? Technically, it's a digital protection flag that is encoded in 
byte 0 (bits 6 and 7) of the S/PDIF subcode area. This flag can have only one 
of three possible States: 

■ Status 00: No copy protection, allowing unlimited copying and subse- 
quent dubbing 

■ Status 10: No more digital copies allowed 

■ Status 11: Allows a single copy to be made of this product, but that copy 
cannot be copied. 

MADI 

The MADI (Multichannel Audio Digital Interface) standard was jointly pro- 
posed as an AES standard by representatives of Neve, Sony and SSL as a straight- 
forward, clutter-free digital interface connection between multitrack devices 
(such as a digital tape recorder, high-end workstation or mixing consolé, as 
shown in Figure 6.16). The format allows up to 56 channels of linearly encoded 
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FIGURE 6.16 

MADI offers a 
clutter-free digital 
interface connection 
between multitrack 
devices. 


MADI digital transmisión lino 



digital audio to be connected via a single 75-Q, video-grade coaxial cable at 
distances of up to 120 feet (50 meters) or at greater distances whenever a fiber- 
optic relay is used. 

In short, MADI makes use of a serial data transmission format that's compatible 
with the AES/EBU twin-channel protocol (whereby the data, Status, User and 
parity bit structure is preserved), and sequentially cycles through each channel 
(starting with Ch. 0 and ending with Ch. 55). The transmission rate of 100 Mbit/ 
sec provides for an overall bandwidth that's capable of handling audio and 
numerous sync codes at various sample rate speeds (including allowances for 
pitch changes either up or down by 12.5% at rates between 32 and 48 kHz). 

ADAT LIGHTPIPE 

A wide range of modular digital multitrack recorders, audio interface, mic pre 
and hardware devices currently use the Alesis lightpipe system for transmitting 
multichannel audio via a standardized optical cable link. These connections 
make use of standard Toslink connectors and cables to transmit up to eight 
channels of uncompressed digital audio at resolutions up to 24 bit at sample 
rates up to 48k over a sequential, optical bitstream. In what is called the S/MUX 
IV mode, a lightpipe connection can also be used to pass up to four channels 
of digital audio at higher sample rates (i.e., 88.2 and 96k). 

Although these connectors are identical to those that are used to optically trans¬ 
mit S/PDIF stereo digital audio, the datastreams are completely incompatible 
with each other. Lightpipe data isn't bidirectional, meaning that the connection 
can only travel from a single source to a destination. Thus, two cables will be 
needed to distribute data both to and from a device. In addition, synchroniza- 
tion data is imbedded within the digital datastream, meaning that no additional 
digital audio sync connections are necessary in order to lock device timing 
docks; however, transport and timecode information is not transmitted and 
will require additional connections. 
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ADAT ópticaI transir ission lina 



analog l/O 1-8 



analog 1/0 9-16 


FIGURE 6.17 

ADAT optical 1/0 

Interconnection 

drawing. 



FIGURE 6.18 

Octane 8-channel 
preamp with ADAT 
lightpipe. (Courtesy 
of M-Audio, a 
división of Avid 
Technology, Inc., 
www.m-audio.com.) 



FIGURE 6.19 

Behringer UltraGain 
Pro-8 8-channel 
preamp and digital 
converter. (Courtesy 
of Behringer Int’l 
GmbH, www. 
behringer.com.) 


It's interesting to note that, although the Alesis ADAT tape-based format has 
virtually disappeared into the sunset, the lightpipe protocol lives on as a pre- 
ferred way of passing multichannel digital audio to and from a digital audio 
interface/DAW combination (Figures 6.17 through 6.19). For example, it's 
common for a modern multichannel mic pre to have a lightpipe output, allow- 
ing a stream of up to eight digital outs to be easily inserted into a suitable audio 
interface, with a single optical cable, thus increasing the number of inputs by 
eight (or 16, should two ADAT l/O interconnections be available). 

TDIF 

The TDIF (Tascam Digital InterFace) is a proprietary format that uses a 25-pin 
D-sub cable to transmit and/or receive up to eight channels of digital audio 
between compatible devices. llnlike the lightpipe connection, TDIF is a bidirec- 
tional connection, meaning that only one cable is required to connect the eight 
ins and outs of one devi ce to another. Although systems that support TDIF-1 
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cannot send and receive sync information (a sepárate wordclock connection is 
required for that; wordclocks are discussed below), the newer TDIF-2 protocol 
is capable of receiving and transmitting digital audio sync through the existing 
connection, without any additional cabling. 

Signal distribution 

If copies are to be made from a single, digital audio source, or if data is to be 
distributed throughout a connected network using AES/EBU, S/PDIF or MADI 
digital transmission cables, it's possible to daisy chain the data from one device 
to the next in a straightforward fashion (Figure 6.20a). This method works well 
only if a few devices are to be chained together. However, if several devices are 
connected together, time-base errors (known as jitter ) might be introduced into 
the path, with the possible side effects being added noise, distortion and a 
slightly "blurred" signal image. One way to reduce the likelihood of such time- 



FIGURE 6.20 

Digital audio distribution: (a) several devices connected in a daisy-chain fashion; (b) múltiple devices connected using a 
distribution device. 
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base errors is to use a digital audio distribution device that can route the data 
from a single digital audio source to a number of individual device destinations 
(Figure 6.20b). 

What isjitter? 

Jitter is a controversial and widely misunderstood phenomenon. To my knowl- 
edge, it's been explained best by Bob Katz of Digital Domain (www.digido.com, 
Orlando, FL). The following is a brief excerpt of his article "Everything You 
Always Wanted to Know About Jitter But Were Afraid to Ask." Further reading 
on digital audio and mastering techniques can be found in Bob's excellent book, 
Mastering Audio: TheArtand the Science, from Focal Press (www.focalpress.com): 

Jitter is time-base error. ... It is caused by varying time delays in the 
circuit paths from component to component in the signal path. The tino 
most common causes of jitter are poorly designed Phase Locked Loops 
(PLLs) and inaveform distortion due to mismatched impedances and/or 
reflections in the signal path. 

Here is how waveform distortion can cause time-base distortion: The top wave- 
form (Figure 6.21a) represents a theoretically perfect digital signal. Its valué is 
101010, occurring at equal slices of time, represented by the equally spaced 
dashed vertical lines. When the first waveform passes through long cables of 
incorrect impedance, or when a source impedance is incorrectly matched at the 
load, the square wave can become rounded, fast rise times become slow, and 
reflections in the cable can cause misinterpretation of the actual zero Crossing 
point of the waveform. The second waveform (Figure 6.21b) shows some of the 
ways the first might change; depending on the severity of the mismatch you 



(a) 




FIGURE 6.21 

Example of time- 
base errors: (a) a 
theoretically perfect 
digital signal source; 
(b) the same signal 
with jitter errors. 
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might see a triangle wave, a square wave with ringing or simply rounded edges. 
Note that the new transitions (measured at the zero line) in the second wave- 
form occur at unequal slices of time. Even so, the numeric interpretation of the 
second waveform is still 101010! There would have to be very severe waveform 
distortion for the valué of the new waveform to be misinterpreted, which 
usually shows up as audible errors—clicks or tics in the sound. If you hear tics, 
then you really have something to worry about. 

If the numeric valué of the waveform is unchanged, why should we be con¬ 
cerned? Let's rephrase the question: "When (not why) should we become con¬ 
cerned?" The answer is "hardly ever." The only effect of time-base distortion is 
in the listening; as far as it can be proved, it has no effect on the dubbing of 
tapes or any digital-to-digital transfer (as long as the jitter is low enough to 
permit the data to be read; high jitter may result in clicks or glitches as the Circuit 
cuts in and out). A typical D/A converter derives its system dock (the dock that 
Controls the sample and hold Circuit) from the incoming digital signal. If that 
dock is not stable, then the conversions from digital to analog will not occur 
at the correct moments in time. The audible effect of this jitter is a possible loss 
of low-level resolution caused by added noise, spurious (phantom) tones, or 
distortion added to the signal. 

A properly dithered 16-bit recording can have over 120 dB of dynamic range; a 
D/A converter with a jittery dock can deteriórate the audible dynamic range to 
100 dB or less, depending on the severity of the jitter. I have performed listening 
experiments on purist, audiophile-quality musical source material recorded 
with a 20-bit accurate A/D converter (dithered to 16 bits within the A/D). The 
sonic results of passing this signal through processors that trúncate the signal 
at -110, -105 or -96 dB are increased "grain" in the image; instruments losing 
their sharp edges and focus; reduced sound-stage width; and apparent loss of 
level causing the listener to want to turn up the monitor level, even though 
high-level signáis are reproduced at unity gain. Contrary to intuition, you can 
hear these effects without having to turn up the listening volume beyond 
normal (illustrating that low-level ambience cues are very important to the 
quality of reproduction). Similar degradation has been observed when jitter is 
present. Nevertheless, the loss due to jitter is subtle and primarily audible with 
the highest-grade audiophile D/A converters. 

Wordclock 

One aspect of digital audio recording that never seems to get enough attention 
is the need for locking (synchronizing) the sample docks within a connected 
digital audio system to a common timing reference. Left unchecked, it's possible 
that such gremlins as clicks, pops and jitter (oh my!) would make their way 
into the audio chain. Through the use of a single, master timing reference 
known as wordclock, the overall sample-and-hold conversión States during both 
the record and playback process for all digital audio channels and devices within 
the system will occur at exactly the same point in time. 
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How can you tell when the wordclock between múltiple 
devices either isn’t connected or ¡s improperly set? If you 
hear ticks, cllcks and pops over the monitors that sound 
like “pensive monkeys pounding away at a typewriter,” 


you’ve probably found the problem. When this happens, 
stop—and deal with the issue, as those monkeys will 
almost certainly make ¡t into your recordings. 


To further illustrate, let's assume that we're in a room that has four or five docks 
and none of them read the same time! In places like this, you'll never quite 
know what the time really is ... the docks could be running at different speeds 
or at the same speed but are set to different times. Basically, trying to accurately 
keep track of the time would end up being a jumbled nightmare. On the other 
hand, if all of these docks were locked to a single, master dock (remember 
those self-correcting docks that are installed in most schools?), accurately 
keeping track of the time (even when moving from room to room) would be 
much simpler. 

In effect, wordclock works in much the same fashion. If the sample dock (the 
timing reference that determines the sample rate and DSP traffic control) for 
each device were set to opérate in a freewheeling, internal fashion ... the timing 
references for each device within the connected digital audio chain wouldn't 
accurately match up. Even though the devices are all running at the same sample 
rate, the resulting mismatches in time will often result in clicks, ticks, excessive 
jitter and other unwanted grunge. To correct for this, the internal docks of all 
the digital devices within a connected chain must be referenced to a single 
"master" wordclock timing element (Figure 6.22). 



FIGURE 6.22 

Example of word¬ 
clock distribution 
showing that there 
can be only one 
master dock within 
a digital production 
network. 
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Similar to the distribution of timecode, there can only be one master wordclock 
reference within a connected digital distribution network. This reference source 
can be derived from a digital mixer, soundcard or any desired source that can 
transmit wordclock. Often, this reference pulse is chained between the involved 
devices through the use of BNC and/or RCA connectors, using low-capacitance 
cables (often 75-Q, video-grade coax cable is used, although this cable grade 
isn't always necessary on shorter cable runs). 

It's interesting to note that wordclock isn't generally needed when making 
a digital copy directly from one device to another (via such protocols as 
AES, S/PDIF, MADI or TDIF2), because the timing information is actually 
embedded within the data bitstream itself. Only when we begin to connect 
devices that share and communicate digital data throughout a production 
network do we begin to see the immediate need for wordclock. Note also that 
Digidesign had actually created a special versión of wordclock for Pro Tools 
Systems that ticks off 256 pulses for every sample. Superclock (also known 
generically as 256Fs) is transmitted over the same type and cable as wordclock 
and was eventually abandoned in favor of the industry standard wordclock 
protocol. 

It almost goes without saying that there will often be differences in con- 
nections and parameter setups from one system to the next. In addition to 
proper cabling, impedance and termination considerations throughout the 
network, specific hardware and software setups may be required in order to 
get all the device blocks to communicate. To better understand your par¬ 
ticular system's setup (and to keep frustration to a minimum), it's always a 
good idea to keep all of your device's physical or pdf manuals cióse at 
hand. 


DIGITAL AUDIO RECORDING SYSTEMS 

For the remainder of this chapter, we'll be looking at some of the digital 
audio recording device types that are currently available on the market. From 
my own personal viewpoint, not only do I find the here and now of record¬ 
ing technology to be exciting and full of cost-effective possibilities ... I also 
love the fact that there are lots of recording media and device-type options. In 
other words, a digital hardware or software system that works really well for 
me might not be the best and easiest solution for you! As we take a look 
at many of these device and system choices, I hope that you'll take the time 
to learn about each one (and possibly even try your hand at listening to 
and/or working with each system type). In the earlier years of recording, 
there were only a few ways in which a successful recording could be made. 
Now, in the age of technological options ... your mission (should you 
decide to accept it) is to research and test-drive devices and/or production 
Systems to find the one that best suits your needs, budget and personal 
working style. 
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0 I Y 1 

do it yourself 

Tutorial: Further Research 


Of course, it’s important to point out that many of the 
explanations ¡n this chapter are, by design, basic 
introductions to digital audio technology and its underly- 
ing concepts. To gain greater insight, it’s always best to 
roll up your virtual sleeves and dive into books, maga- 


zines and the Internet. A better understanding of a 
technology or technique is often yours for the asking by 
researching and downloading appropriate archived 
magazine articles and user pdf manuals. 


Samplers 

Historically, one of the first production applications in digital audio carne in 
the form of the drum machine, which gave us the ability to trigger prerecorded 
drum and percussion sounds from a single, dedicated electronic instrument. 
This made it possible for electronic musicians to add percussion samples to 
their own compositions without having to hire out a drummer/percussionist. 
Out of this innovation sprang a major class of sample and synthesis technology 
that has fueled electronic production technology over its history. 

Not long after the introduction of the basic drum machine, devices were devel- 
oped that were capable of recording, musically transposing, processing and 
reproducing segments of digitized audio directly from RAM (random access 
memory) ... and thus, the sampler was born. 

Over several decades, the simple hardware sample playback device has evolved 
into more complex hardware and (more recently) software Systems (Figures 
6.23 and 6.24) that are capable of streaming hundreds of carefully crafted 
samples directly from a Computer s hard disk and RAM memory. 

Assuming that sufficient memory is available, any number of audio samples can 
be loaded into a system in such a way that their playback can be transposed in 



FIGURE 6.23 

Yamaha MOTIF XS6 
keyboard worksta- 
tion. (Courtesy of 
Yamaha Corporation 
of America, www. 
yamaha.com.) 
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real time (either up or down) over a number of octave ranges in a musical 
fashion. Since memory is often (but not always) limited in size, the digital 
sample segments are generally limited in length and range from only a few 
seconds to one or more minutes. Quite simply, these musical transpositions 
occur by reproducing the recorded digital audio segments at sample rates that 
correspond to established musical intervals. These samples can then be assigned 
to a specific note on a MIDI controller or mapped across a keyboard range in 
a multiple-voice (polyphonic) musical fashion. 

Hard-disk recording 

As project and Computer technology matured, advances in hardware and digital 
audio technology began to open the doors for recording audio as longer and 
longer sample files into RAM. It wasn't long after this that Computer technology 
advanced to the point where specialized hardware, software and I/O interfacing 
could be used to record, edit and play back audio directly from a computer's 
hard disk. Thus, the concept of the hard-disk recorder was born. 

A few of the numerous advantages to using a hard-disk recording system in an 
audio production environment inelude: 

■ The ability to handle múltiple samplefdes: Hard-disk recording time is often 
limited only by the size of the disk, while the track count is determined 
by the speed and throughput of the overall system. 

■ Random-access editing: Once audio (or any type of media data) is recorded 
onto a disk, any segment of the program can be instantly accessed at any 
time, regardless of the order in which it was recorded. 

■ Nondestructive editing: A process that allows audio segments (often called 
regions ) to be placed in any context and/or order within a program without 
changing or affecting the originally recorded sound file in any way. Once 
edited, these edited tracks and/or segments can be reproduced to create a 
single, cohesive song or project. 

■ DSP: Digital signal processing can be performed on a sound file and/or 
segment in either real time or non-real time (often in a nondestructive 
fashion). 
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Add to this the fact that computer-based digital 
audio devices can intégrate various media types 
(such as audio, video and MIDI) into a single 
mean, lean production machine that's often easy 
to use, cost effective and time effective ... and 
you have a system that offers the artist and engi- 
neer an unprecedented degree of production 
power. Feel free to check out Chapter 7 for more 
information on the basics of hard-disk recording 
within the digital audio workstation (DAW) 
environment. 


FIGURE 6.25 

Tascam X-48 
multitrack hard-disk 
recorder. (Courtesy of 
Tascam, www. 
tascam.com.) 


Hard-disk multitrack recorders 


Unlike software DAWs that use the graphic user 
interface (GUI) of a personal Computer to offer 
up an overall music production environment, 
some people opt to record using a hard-disk mul¬ 
titrack recording system (Figure 6.25) that 
mimics the basic transport, operational and remóte Controls of a traditional 
multitrack recorder. The basic allure of such a recording system is that it makes 
use of a simple, dedicated multitrack hardware interface in a way that combines 
with the speed, flexibility and nonlinear benefits of a hard-disk-based system. 


Often these devices will incorpórate one or more removable hard-drive bays 
and inelude some form of file compatibility with existing workstation software, 
allowing for direct sound file import, edit and export capabilities from within 


the DAW. 


Portable studios 

Another type of dedicated hardware recording system is the modern-day por¬ 
table studio (often known by Tascam's patented ñame "Portastudio"), which is 
capable of recording to hard disk or solid-state flash memory. These all-in-one 
Systems inelude all of the required hardware and the control system's interface 
to record, edit, mix down and play back a project virtually anywhere when using 
an AC adapter or batteries (Figure 6.26). These hardware devices, which range 
in size, features and track offerings, are often a system of convenience for musi- 
cians, because they inelude all of the necessary mic preamps, mixing surface 
Controls, effeets and integrated CD burning capabilities to simply set up and 
start recording out of the box (Figure 6.27). 

Flash memory handhelds 

Recently, a new class of portable recording device has come onto the scene. 
These devices, known as handhelds (Figure 6.28), are so small that they'll actu- 
ally fit in your pocket. Recording to solid-state flash memory, handhelds often 
inelude a set of built-in high-quality electret-condenser mies and offer many of 
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FIGURE 6.26 

Alesis ¡MultiMix 16 
USB actually makes 
use of an iPod’s hard 
disk for sound file 
access and storage. 
(Courtesy of Alesis, 
www.alesis.com.) 



¿Alesis 


FIGURE 6.27 

Tascam DP-02CF 
compact flash digital 
system has a user 
interface that’s 
virtually identical to 
the classic Portastu- 
dio. (Courtesy of 
Tascam, www. 
tascam.com.) 



the recording and overdub features that you might expect from a larger portable 
recording system, such as a USB link (allowing for easy data transfer or for 
directly turning the device into a USB interface with built-in mies!), built-in 
effeets, microphone emulation (allowing the mic's character to be changed to 
conform to a number of mic styles), a guitar tuner, etc. Certain handhelds allow 
external professional mies to be plugged-in that can be phantom powered, 
while others are designed with four mic capsules that even allow for quad sur- 
round recording. 

In fact, these small, lightweight and powerful recording systems are often of 
surprisingly high-quality, and can be used for any number of purposes, such as: 
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■ Recording a live concert with amazing ease 

■ Recording on-mic sound that can be 
synched up with a video camera or video 
project 

■ Recording dialog (for books-on-tape, etc.) 

■ Easily capturing music and nature samples 

■ Easily capturing samples for film (Foley 
and SFX) 

■ Crude speaker and room calibration (for 
matching L/R and even surround volume 
levels). 

Older technologies 

Although a number of recording technologies 
have come and gone over the years, there are a 
few recording formats that are common enough 
to definitely warrant mentioning. 

MINIDISC 



FIGURE 6.28 

Zoom H4 4-track 
portable recorder. 
(Courtesy of Samson 
Tech, www. 
samsontech.com.) 


Since its introduction by Sony as a consumer 
device in 1992, the MiniDisc (MD) has estab- 

lished itself as a viable médium for recording and storing CD, MP3 tracks and 
original recordings (Figure 6.29). Based on established rewritable magneto- 
optical (MO) technology, the 64-mm disc system has a standard recording 
capacity of up to 74 minutes (effectively the same record/play time as the CD, 
at one-quarter the size). This extended record/play time is due to a compression 
codee known as Adaptive Transform Acoustic Coding (ATRAC), which makes 



FIGURE 6.29 

Portable MiniDisc 
recorder. 
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use of an encoding scheme that reduces data by eliminating or reducing data 
that has been deemed to be imperceptible by the average listener (in much the 
manner of MP3 and other compression codees). Probably the biggest selling 
point of the MiniDisc recording médium is its portability, although this has 
been challenged by the introduction of the previously mentioned flash memory 
handheld recorder. 

THE ROTATING-HEAD DIGITAL AUDIO RECORDER 

Historically, rotating-head technology is the backbone of a wide range of media 
production systems. Not only is it the basis of numerous digital audio tech- 
nologies, such as DAT (digital audio tape) and MDM (modular digital multi- 
track) recording systems—it's the basis of early video technologies, including 
the videocassette and its professional counterparts. For this reason alone, a basic 
introduction and coverage of these technologies is warranted. 

THE ROTARY HEAD 

Even if it's not as relevant in this day and age of solid-state and hard-disk 
memory technologies, the rotary head is still of tremendous importance in a 
number of media-based devices, including the professional and consumer VCR, 
DAT recorders, modular digital multitracks and more. 

Because of the tremendous amount of data density that's required to record and 
reproduce video, broadband and digital audio media, the recording of data onto 
a single, linear tape track is impractical (for example, Vera, the first video 
machine ever made, used 20)4-inch reels of wire that were running at speed of 
200 inches/sec—almost 17 feet/sec—to encode video)! Of course, running a 
recording médium at this rate was out of the question ... so, in order to get 
around this bandwidth limitation, a rotating-head helical sean path was incor- 
porated to effectively increase the recorded data path, while making use of a 
slow-moving tape. 

True to its ñame, this process uses magnetic recording heads that are mounted 
onto a rotating drum that's mounted at a slant with respect to the tape's moving 
axis. The tape is partially wrapped around the drum, so as to make contact with 
the rotating heads. As the head rotates, numerous closely spaced tape "sean" 
paths (or "swipes") are consecutively recorded along the tape path at a slanted 
angle. If we were able to lay these angular tracks out in a straight line, the result- 
ing recorded track would be much longer in length than would be possible with 
a linear track, resulting in a system that can record wider frequeney bandwidths 
at a slow-moving tape speed. Examples of a helical transport that employ rotary 
head technology are shown in Figure 6.30. 

DIGITAL AUDIO TAPE (DAT) SYSTEM 

The digital audio tape, or DAT, recorder is a compact recording médium for 
recording PCM digital audio (Figure 6.31). DAT technology makes use of an 
enclosed compact cassette that's slightly larger than modern miniDV videotapes. 
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FIGURE 6.30 

Helical sean path: 

(a) tape path is 
wrapped around the 
drum mechanism; 

(b) recording tracks 
at a helical “spiraling 
slant” angle. 


DAT recorders are commonly equipped with both analog and digital input/ 
outputs, and the specification allows for recording/playback at three standard 
sampling frequencies: 32, 44.1, and 48 kHz (although sample rate capabilities 
and system features may vary from one recorder to the next). Current DAT tapes 
offer running times of up to 2 hours when sampling at 44.1 and 48 kHz. As 
technology marches on, this tape-based médium continúes to give way to newer 
flash memory and hard-disk-based devices. 
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FIGURE 6.31 

Tascam DA-P1 
portable DAT 
recorder. (Courtesy of 
Tascam, www. 
tascam.com.) 



FIGURE 6.32 

A half-wrap tape 
path, showing a 90 c 
amount of tape-to- 
head contact. 


B h *ad DAT TAPE/TRANSPORT FORMAT 

The actual track width of the DAT format's helical 
sean can range downward to about 1/1 Oth the 
thickness of a human hair (which allows for a 
recording density of 114 million bits per square 
inch). To assist with tape tracking and to main- 
tain the highest quality playback signal, a track¬ 
ing correction system is used to center the heads 
directly over the tape's sean path. The head 
assembly of a DAT recorder uses a 90° half-wrap 
tape path. (Figure 6.32 shows a two-head assem¬ 
bly in which each head is in direct contact with 
the tape 25% of the time.) Such a discontinuous 
signal requires that the digital data be commu- 
nicated to the heads in digital "bursts," which necessitates the use of a digital 
buffer. On playback, these bursts are again smoothed out into a continuous 
datastream. 



THE MODULAR DIGITAL MULTITRACK 

Another device that is less commonly seen in the pro and project setting is the 
modular digital multitrack system or MDM recording system. These historically 
important devices single-handedly ushered professional and project studio pro- 
duction out of the analog age and into the affordable age of the personal project 
studio. This techno explosión occurred because these compact multitrack digital 
audio recorders were capable of recording eight tracks of digital audio onto 
videotape-grade cassette cartridges that could often be bought at your favorite 
neighborhood drugstore. At the time of their introduction, another important 
aspect of the MDM revolution was their price. Amazingly, these devices offered 
digital audio recording quality to the pro and project studio at a cost that was 
often 1/1 Oth the price of a comparable analog machine. MDMs are said to be 
"modular" because several of them can be linked together (in groups of eight) 
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FIGURE 6.33 

Tascam DA-98HR 
modular digital 
multitrack recorder. 
(Courtesy of Tascam 
www.tascam.com.) 


in a synchronous fashion that allows them to work in tándem as a large-scale 
multitrack recording system (often configured in the 24- and 32-track range). 

ADATMDM FORMAT 

The ADAT standard, which was created by the Alesis Corporation, is no longer 
available on the market (although, as was previously mentioned, the 8-channel 
ADAT optical transmission link is still going strong). These rotary-head, eight- 
track modular digital multitrack recorders made use of standard S-VHS video¬ 
tape to record PCM audio at 44.1k/48k at 16-/20-bit word depths. 

DTR5 MDM FORMAT 

Created by Tascam, the rotary-head Digital Tape Recording System (DTRS; see 
Figure 6.33) is capable of recording up to 108 minutes of digital audio onto a 
standard 120-minute Hi-8-mm videotape, and (like the ADAT format) can be 
combined with other DTRS recorders to create a system that has 24, 32 or more 
tracks. Digital I/O is made through the proprietary TDIF connection, which uses 
a special 25-pin D-sub connector to link to other DRTS recorders, digital mixers, 
hard-disk recorders or external accessories. The TDIF digital interconnection 
comes in two flavors: 

■ TDIF-1 is capable of transmitting and receiving all eight channels of digital 
audio (in a bidirectional I/O fashion) over a single cable 

■ TDIF-2 is capable of transmitting eight channels of digital audio, as well 
as sync data without the need for an external connection. 
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CHAPTER 7 


The Digital Audio 
Workstation 


Over the history of digital audio production, the style, form and function of 
hard-disk recording has changed to meet the challenges of faster processors, 
bigger drives, improved hardware Systems and the ongoing push of marketing 
forces to sell, sell, sell! As a result, there are numerous hard-disk system types 
that are designed for various purposes, budgets and production styles. As new 
technologies and programming techniques continué to turn out new hardware 
and software Systems at a staggering pace, many of the long-held production 
limitations have vanished as increased track counts, processing power and 
affordability have changed the way we see the art of production itself. In recent 
years, no single term signifies these changes more than that of the "DAW." 

In recent years, the digital audio workstation (DAW) has come to signify an inte- 
grated computer-based hard-disk recording system that commonly offers a wide 
and ever-changing number of production features such as: 

■ Advanced multitrack recording, editing and mixdown capabilities 

■ MIDI sequencing, edit and score capabilities 

■ Integrated video and/or video sync capabilities 

■ Integration with peripheral hardware devices such as controllers, MIDI 
and audio interface devices 

■ Plug-in DSP (digital signal processing) support 

■ Support for plug-in virtual instruments 

■ Support for integrating timing and control elements with other produc¬ 
tion software (ReWire). 

Truth of the matter is ... by offering a staggering amount of production power 
for the buck, these software-based programs (Figures 7.1 through 7.3) and their 
associated hardware devices have revolutionized the faces of professional, 
project and personal studios in a way that touches almost every life within the 
audio and music production communities. 

INTEGRATION NOW... INTEGRATION FOREVER! 

Throughout the history of music and audio production, we've grown used to 
the idea that certain devices were only meant to perform a single task: A recorder 
records and plays back, a limiter limits and a mixer mixes. Fortunately, the age 



Integration Noui... Integration Forever! 
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FIGURE 7.1 

Pro Tools hard-disk editing workstation for the Mac® or PC. (Courtesy of Digidesign, a división of Avid, www.digidesign. 
com.) 
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FIGURE 7.2 

Cubase 4 Media Production System for the Mac or PC. (Courtesy of Steinberg Media Technologies GmbH, www. 
steinberg.net.) 


of the microprocessor has totally broken down these traditional lines in a way 
that has created a breed of digital chameleons that can change their functional 
colors as needed to match the task at hand. Along these same lines, the digital 
audio workstation isn't so much a device as a systems concept that can perform 
a wide range of audio production tasks with relative ease and speed. Some of 
the characteristics that can (or should be) offered by a DAW inelude: 

■ Integration: One of the biggest features of a workstation is its ability 
to provide centralized control over the digital audio recording, editing, 
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FIGURE 7.3 

Logic DAW for the 
Mac. (Courtesy of 
Apple Inc., www. 
apple.com.) 


Processing and signal routing functions within the production system. It 
should also provide for direct Communications with production-related 
hardware and software Systems, as well as transport and sync control to/ 
from external media devices. 

■ Communication: A DAW should be able to communicate and distribute 
pertinent audio, MIDI and automation-related data throughout the con- 
nected network system. Digital timing (wordclock) and synchronization 
(SMPTE timecode and/or MTC) should also be supported. 

■ Speed and flexibility: These are probably a workstation's greatest assets. After 
you've become familiar with a particular system, most production tasks 
can be tackled in far less time than would be required using similar analog 
equipment. Many of the extensive signal processing, automation and 
systems Communications features would be far more difficult to accom- 
plish in the analog domain. 

■ Automation: Because all of the functions are in the digital domain, the 
ability to instantly save and recall a session and to instantly undo a per- 
formed action becomes a relatively simple matter. 

■ Expandability: Most DAWs are able to intégrate new and important hard¬ 
ware and software components into the system with little or no difficulty. 

■ User-friendly operation: An important element of a digital audio worksta¬ 
tion is its ability to communicate with its central interface unit: you! The 
operation of a workstation should be relatively intuitive and shouldn't 
obstruct the Creative process by speaking too much "computerese." 

Vm sure you've gathered from the above that a software system (and its associ- 
ated hardware) that's capable of integrating audio, video and MIDI under a 
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single, multifunctional umbrelia can be a major investment, both in financial 
terms and in terms of the time that's spent learning to master the overall 
program environment. When choosing a system for yourself or your facility, be 
sure to take the above considerations into account. Each system has its own 
strengths, weaknesses, and particular ways of working. When in doubt, it's 
always a good idea to research the system as much as possible before commit- 
ting to it. Feel free to contact your local dealer for a salesroom test drive. As 
with a new car, purchasing a DAW can be an expensive proposition that you'll 
probably have to live with for a while. Once you've taken the time to make the 
right choice, you can get down to the business of making music. 


DAW HARDWARE 

Keeping step with the modern-day truism "technology marches on," the hard¬ 
ware and software specs of a Computer and the connected peripherals continué 
to change at an ever-increasing pace. This is usually reflected as general improve- 
ments in such areas as their: 

■ Need for speed (múltiple processors) 

■ Increased computing power 

■ Increased disk size and speed 

■ Increased memory size and speed 

■ Operating system (OS) and peripheral integration 

■ General connectivity (networking and the Web). 

In this day and age, it's definitely important that you keep step with the ever- 
changing advances in computer-related production technology. That's not to say 
you need to update your system every time a new hard- or soft-whizbang comes 
on the market. On the contrary, it's often a wise person who knows when a 
system is working just fine for his or her own personal needs and who does the 
research to update software and fine-tune the system (to the best of his or her 
ability). On the other hand, there will come a time (and you'll know all too 
well when it arrives) that this "march" of technology will dictate a system change 
to keep you in step with the times. As with almost any aspect of technology, 
the best way to judge what will work best for you and your system is to research 
any piece of hardware that you're considering ... quite simply, read the specs, 
reads the reviews, ask your friends and then make your best, most informed 
choice. 

When buying a Computer for audio production, one of the most commonly 
asked questions is "Which one ... Mac or PC?" The actual answer as to which 
operating system (OS) will work best for you will actually depend on: 

■ Your preference 

■ Your needs 

■ The kind of software you currently have 

■ The kind of Computer platform and software your working associates or 
friends have. 
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Any of these factors can be a determining factor—cost, the fact that you already 
have lots of PC or Mac software, the fact that you grew up with and are comfort- 
able with a particular OS. Even this particular question is being sidestepped 
with the advent of Apple's Boot Camp, which allows a Mac to boot up under 
the Mac or Windows OS, giving you freedom of choice. 

Once you've decided which side of the platform tracks you'd like to live on, the 
more important questions that you should be asking are: 

■ Is my Computer fast and powerful enough for the tasks at hand? 

■ Does it have enough hard disks that are large and fast enough for my 
needs? 

■ Is there enough random access memory (RAM)? 

■ Do I have enough monitor space (real estáte) to see the important things 
at a glance? 

On the "need for speed" front, it's always a good idea to buy (or build) a Com¬ 
puter at the top of its performance range at any given time. Keeping in mind 
that technology marches on, the last thing that you'll want to do is buy a new 
Computer only to soon find out that it's underpowered for the tasks ahead. 

The newer dual, quad and eight-core (multiprocessor) systems allow for faster 
calculations. Their tasks are spread across múltiple CPUs; for example, a number 
of DAWs allow for their odd/even track counts to be split across múltiple CPUs 
to increase the overall processing load, for added track count and DSP capa- 
bilities. 

With today's faster and higher capacity serial ATA, IDE and SCSI drives, it's a 
simple matter to install cost-effective drives (each with a capacity of hundreds 
of gigabytes) into a system. These drives can be internal, or they can be installed 
in portable drive cases (Figure 7.4) that can be plugged into either a FireWire® 



FIGURE 7.4 

Rain Recording 
StormDrive FireWire 
and USB2 external 
drive. (Courtesy of 
Rain Recording, LLC, 
www.rainrecording. 
com.) 
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or USB2 port, making it easy to take your own personal drive with you to the 
studio or on-stage. 

The speed at which the disc platters turn will often affect a drive's access time. 
Modern drives that spin at 7200 rpm or higher with 8 MB or more of internal 
cache memory are often preferable for both audio and video production. Higher 
amounts of buffer memory in the drive itself can also be helpful in improving 
data flow and in freeing up the system for other processing functions. Within a 
production system, it's also good to have a second drive that's strictly dedicated 
to your media files ... this is always a good practice, because data could be 
interrupted or slowed should the main drive need to be accessed by the com¬ 
putéis operating system. 

Regarding random access memory, it's always good to use as much RAM that is 
as fast as you can muster. If a system doesn't have enough RAM, data will often 
have to be swapped to the system's hard drive, which can slow things down and 
affect overall performance. When dealing with video and digital images, having 
a sufficient amount of RAM becomes even more of an issue. 

With regard to system and application software, it's often wise to perform an 
update to keep your system ... well ... up-to-date. This holds true even if you 
just bought the software, because it's often hard to tell how long it's been sitting 
on the shelves—and even if it is brand-spanking new, chances are new revisions 
will still be available. Updates don't come without their potential downfalls; 
given the staggering number of hardware/software system combinations that 
are available, it's actually possible that an update might do more harm than 
good. In this light, it's actually not a bad idea to do a bit of research before 
clicking that update button. Who said all this stuff would be easy? 

Just like there never seems to be enough space around the house or apartment, 
having a single, undersized monitor can leave you feeling cramped for visual 
"real estáte." For starters, a sufficiently large monitor that's capable of working 
at higher resolutions will greatly increase the size of your visual desktop; 
however, if one is a good thing, two is always better! Both Windows® and Mac 
OS Systems offer support for dual monitors (Figure 7.5). By adding a commonly 
available "dual-head" video card, your system can easily double your working 
monitor space for less bucks than you might think. I've found that it's truly a 
joy to have your edit window, mixer, effects sections, and transport Controls in 
their own places—all in plain and accessible view. 

The desktop Computer 

Desktop computers are often (but not always) too large and cumbersome to 
lug around. As a result, these systems are most often found as a permanent 
installation in the professional, project and home studio (Figures 7.6 and 7.7). 
Historically, desktops have offered more processing power than their portable 
counterparts, but in recent times, this distinction has become less and less of 
a factor. 
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FIGURE 7.5 

You can never have enough “real estáte”! 



FIGURE 7.6 

The Mac Pro™ with 
Cinema display. 
(Courtesy of Apple 
Computers, Inc., 
www.apple.com.) 


The laptop Computer 

One of the most amazing characteristics of the digital age is miniaturization. 
At the forefront of the studio-a-go-go movement is the laptop Computer (Figures 
7.8 and 7.9). Out of the advent of smaller, lighter and more powerful notebooks 
has come the technological Phoenix of the portable DAW and music perfor¬ 
mance machine. With the advent of USB and FireWire audio interfaces, control- 
lers and other peripheral devices, these systems are now capable of handling 
most (if not all) of the edit and processing functions that can be handled in 
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FIGURE 7.7 

ION Digital Media 
Workstation. 
(Courtesy of Rain 
Recording, LLC, 
www.rainrecording. 
com.) 



FIGURE 7.8 

Rain Recording 
LiveBook™ laptop. 
(Courtesy of Rain 
Recording, LLC, 
www.rainrecording. 
com.) 



the studio. In fact, these AC/battery-powered Systems are often powerful enough 
to handle advanced DAW edit/mixing functions, as well as being able to happily 
handle a wide range of plug-in effects and virtual instruments, all in the comfort 
of ... anywhere! 

That's the good news! Now, the downside of all this portability is the fact that, 
since laptops are optimized to run off of a battery with as little power drain as 
possible, their: 
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FIGURE 7.9 

The MacBook Pro. 
(Courtesy of Apple 
Computers, Inc., 
www.apple.com.) 


■ Processors will often run slower, so as to conserve on battery power. 

■ BIOS (the important subconscious brains of a Computer) might be differ- 
ent (again, especially with regarás to battery-saving features). 

■ Hard drives might not spin as fast (generally they're shipped with 5400- 
rpm speed drives). 

■ Video display capabilities are sometimes limited when compared to a 
desktop (video memory is often shared with system RAM, reducing graphic 
quality and refresh rate). 

■ Internal audio interface usually isn't so great (but that's why there are so 
many external interface options). 

Although the central processing unit (CPU) will often run more slowly than a 
desktop (usually to reduce power consumption in the form of heat), most 
modern laptops are more than powerful enough to perform on the road. For 
this reason, it's always best to get a high-quality system with the fastest CPU(s) 
that you can afford. 

Most often, the primary problems with a laptop lie in their basic BIOS and 
OS battery-saving features. When it comes to making music with a laptop, 
there actually is a real difference between the Mac and a PC. Basically, there's 
little difference between a Mac laptop and a Mac tower, because the BIOSs 
are virtually identical. Conversely, the BIOS of a laptop PC is often limited 
in power and functional capabilities when compared to a desktop. When 
shopping for a PC laptop, it's often good to research how a particular BIOS 
chipset will work for music, particularly with regard to certain audio interface 
devices. In this instance, it's always good to research and read up on the laptop 
that's caught your fancy before you buy (also, keep in mind that the store's 
return policy might buy you time to take it for a technological spin before 
committing). 

Both PC and Mac laptops have an automatic power-saving feature (called "speed 
step" and "processor cycling," respectively) that changes the CPU's speed in 
much the same way that a vehicle changes gears in order to save energy. Often 
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these gear changes can wreak havoc on many of the DSP functions of an audio 
workstation. Turning them off will greatly impro ve performance, at the expense 
of reduced battery life. 

Hard-drive speeds on a laptop are often limited when compared to a desktop 
Computer, resulting in slower access times and fewer track counts on a multi- 
track DAW. Even though these speeds are often more than adequate for general 
music applications, speeds can often be improved through the use of a 7200- 
rpm or faster external USB2 or FireWire drive. 


Again, when it comes to RAM, it's often good to pack as much into the laptop 
as you can. This will reduce data swapping to disk when using larger audio 
applications (especially software synths and samplers). Within certain systems, 
the laptop's video card capabilities will run off of the system's RAM (which can 
severely limit audio processing functions in a DAW). Memory-related problems 
can also crop up when using a motherboard to run a dual-monitor (LCD and 
external monitor) setup. 


FIGURE 7.10 

Studio Pack Mobile 
Laptop Studio 
Backpack. (Courtesy 
of M-Audio, a 
división of Avid, 
www.m-audio.com.) 


It almost goes without saying that the internal audio quality of most laptops 
ranges from being acceptable to abysmal. As a result, about the only true choice 
is to find an external audio interface that works best for you and your applica¬ 
tions. Fortunately, there's a ton of audio interface choices for connecting via 
either FireWire or USB2, ranging from a simple stereo I/O device to those that 
inelude multitrack audio, MIDI and controller capabilities in a small, on-the-go 
package. 



Accessories and accessorize 

I know it seems like an afterthought, but there's 
a wide range of hardware and travel accessories 
that can help you to take your portable rig on the 
road. Just a small listing ineludes: 

■ Faptop backpacks for storing your Computer 
and gear in a safe, fun case (Figure 7.10) 

■ Instrument cases and covers 

■ Flexible FED gooseneck lights that lets you 
view your keyboard in the dark or on-stage 

■ Faptop DJ stands for raising your laptop above 
an equipment-packed table. 

SYSTEM INTERCONNECTIVITY 

In the not-too-distant past, installing a device 
into a Computer or connecting between Com¬ 
puter systems could be a major hassle. With the 
development of the USB and FireWire protocols 
(as well as the improved general programming 
of hardware drivers), hardware devices such as 
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mice, keyboards, cameras, soundcards, módems, MIDI interfaces, CD and hard 
drives, MP3 players, portable fans, LED Christmas trees and cup warmers ... 
can be plugged into an available port, installed and be up and running in no 
time, (usually) without a hassle. 

With the development of a standardized network protocol, it's now possible to 
link computers together in a way that allows for the fast and easy sharing of 
data throughout a connected system. Using such a system, artists and businesses 
alike can easily share and swap files on the other side of the world, and pro or 
project studios can share sound files and video files using an Ethernet connec- 
tion with relative ease. 


USB 

In recent Computer history, few interconnection protocols have affected our lives 
like the universal serial bus (USB). In short, USB is an open specification for 
connecting external hardware devices to the personal Computer, as well as a 
special set of protocols for automatically recognizing and configuring them. The 
first of the following two speeds are supported by USB 1.0, while all three are 
supported by USB 2.0: 

■ USB 1.0 (1.5 Mbits/sec): A low speed for the attachment of low-cost 
peripherals (such as a joystick or mouse) 

■ USB 1.0 (12 Mbits/sec): For the attachment of devices that require a higher 
throughput (such as data transfer, soundcards, digitally compressed video 
cameras and scanners) 

■ USB 2.0 (480 Mbits/sec): For high-throughput and fast transfer of the 
above applications. 

The basic characteristics of USB inelude: 

■ Up to 127 external devices can be added to a system without having to 
open up the Computer. As a result, the industry is moving toward a "sealed 
case" or "locked-box" approach to Computer hardware design. 

■ Newer operating systems will often automatically recognize and configure 
a basic USB device that's shipped with the latest device drivers. 

■ Devices are "hot pluggable," meaning that they can be added (or removed) 
while the Computer is on and running. 

■ The assignment of system resources and bus bandwidth is transparent to 
the installer and end user. 

■ USB connections allow data to flow bidirectionally between the Computer 
and the peripheral. 

■ USB cables can be up to 5 meters in length (up to 3 meters for low-speed 
devices) and inelude two twisted pairs of wires, one for carrying signal 
data and the other pair for carrying a DC voltage to a "bus-powered" 
device. Those that use less than 500 milliamps (1/2 amp) can get their 
power directly from the USB cable's 5-V DC supply, while those having 
higher current demands will need to be externally powered. 
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FIGURE 7.11 

Example of a 4-port 
USB audio hub. 
(Courtesy of Griffin 
Technology, www. 
griffintechnology. 
com.) 



■ Standard USB cables have different connectors at each end. For example, 
a cable between the PC and a device would have an "A" plug at the PC 
(root) connection and a "B" plug for the devices receptacle. 

Cable distribution and "daisy-chaining" are done via a data "hub" (Figure 7.11). 
These devices act as a traffic cop in that they cycle through the various USB 
inputs in a sequential fashion, routing the data into a single data output line. 
It should be noted that not all hubs are created equal. In certain situations, the 
chipset that's used within a hub might not be compatible with certain MIDI 
and audio interface Systems. Should a connection problem arise, contact the 
manufacturer of your audio-related device for advice. 

FireWire 

Originally created in the mid-1990s by Apple (and later standardized as IEEE- 
1394), the FireWire protocol is similar to the USB standard in that it uses 
twisted-pair wiring to communicate bidirectional, serial data within a hot- 
swappable, connected chain. Unlike USB (which can handle up to 127 devices 
per bus), up to 63 devices can be connected within a connected FireWire chain. 
FireWire supports two speed modes: 
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■ FireWire 400 or IEEE-1394a (400 Mbits/sec) is capable of delivering data 
over cables up to 4.5 meters in length. FireWire 400 is ideal for commu- 
nicating large amounts of data to such devices as hard drives, video cam- 
corders and audio interface devices. 

■ FireWire 800 or IEEE-1394b (800 Mbits/sec) can communicate large 
amounts of data over cables up to 100 meters in length. When using fiber- 
optic cables, lengths in excess of 90 meters can be achieved in situations 
that require long-haul cabling (such as sound stages and studios). 

llnlike USB, compatibility between the two modes is mildly problematic, 
because FireWire 800 ports are configured differently from their earlier prede- 
cessor ... and therefore require adapter cables to ensure compatibility. 

Networking 

Beyond the concept of connecting external devices to a single Computer, another 
concept hits at the heart of the connectivity age— networking. The ability to set 
up and make use of a local area network (LAN) can be extremely useful in the 
home, studio and/or office, in that it can be used to link múltiple computers 
with various data, platforms and OS types. In short, a network can be set up in 
a number of different ways, with varying degrees of complexity and administra- 
tive levels. There are two common ways that data can be handled over a LAN 
(Figure 7.12): 

■ The first is a system whereby the data that's shared between linked com¬ 
puters resides on the respective computers and is communicated back and 
forth in a decentralized manner. 

■ The second makes use of a centralized Computer (called a server ) that uses 
an array of high-capacity hard drives to store "all" of the data that relates 
to the everyday production aspects of a facility. Often, such a system will 
have a redundant set of drives (RAID) that actually clones the entire 
system on a moment-to-moment basis as a safety backup procedure. In 
larger facilities where data integrity is critical, a set of backup tapes may 
be made on a daily basis for extra insurance and archival purposes. 

No matter what level of complexity is involved, some of the more common uses 
for working with a network connection inelude: 

■ Sharing files: Within a connected household, studio or business, a LAN can 
be used to share files, sound files, video images—virtually anything— 
throughout the connected facility. This means that various productions 
rooms, studios and offices can simultaneously share and swap data and/ 
or media files in a way that's often transparent to the users. 

■ Shared Web connection: One of the cooler aspects of using a LAN is the 
ability to share an Internet connection over the network from a single, 
connected Computer or server. The ability to connect from any Computer 
with ease is just another reason why you should strongly consider wiring 
your studio and/or house with LAN connections. 


System Interconnectivity 




FIGURE 7.12 

Local area network 
(LAN) connections. 

(a) Data can be 
shared between 
independent 
computers in a home 
or workplace LAN 
environment. 

(b) Computer 
termináis may be 
connected to a 
centralized server, 
allowing data to be 
stored, shared and 
distributed from a 
central location. 
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■ Archiving and backup: In addition to the benefits of archiving and backing 
up data with a server system ... even the simplest LAN can be a true life- 
saver. For example, let's say that we need to make a backup DVD of a 
session but don't have the time to tie up our production DAW. In this 
situation, we could simply burn the disc on another Computer that's con- 
nected to the system, and continué working away, without interruption. 

■ Accessing sound files and sample librarles: It goes without saying that sound 
and sample files can be easily accessed from any connected Computer.... 
Actually, if you're wireless (or have a long enough cable), you can go out 
to the pool and soak up the sun while working on your latest project! 

On a final note, those who are unfamiliar with networking are urged to learn 
about this powerful and easy-to-use data distribution and backup system. For 
a minimal investment in cables, hubs and educational reading, you might be 
surprised at the time, trouble and life-saving benefits that will be almost instantly 
realized. 

THE AUDIO INTERFACE 

An important device that deserves careful consideration when putting together 
a DAW-based production system is the digital audio interface. These devices can 
have a single, dedicated purpose, or they might be multifunctional in nature— 
in either case, their main purpose in the studio is to act as a connectivity bridge 
between the outside world of analog audio and the Computer's inner world of 
digital audio (Figures 7.13 through 7.17). Audio interfaces come in all shapes, 
sizes and functionalities; for example, an audio interface can be: 

■ Built into a Computer (although, more often than not, these devices are 
often limited in quality and functionality) 

■ A simple, two-I/O audio device 

■ Multichannel, offering eight analog I/Os and numerous I/O expansión 
options 

■ Fitted with one or more MIDI I/O ports 

■ One that offers digital I/O, wordclock and sync options 
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FIGURE 7.13 

Presonus AudioBox 
USB 2 x 2 USB 
record i ng system. 
(Courtesy of 
Presonus Audio 
Electronics, Inc., 
www.presonus.com.) 
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FIGURE 7.14 

M-Audio Fast Track 
Ultra High-Speed 
8 x 8 USB 2.0 
interface with MX 
core DSP technology. 
(Courtesy of M-Audio, 
a división of Avid, 
www.m-audio.com.) 
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FIGURE 7.15 

MOTU 828mk3 FireWire audio interface with effects. (Courtesy of MOTU, Inc., www.motu.com.) 


FIGURE 7.16 

Mackie Onyx 1200F 
studio recording 
preamp and 192-kHz 
FireWire interface. 
(Courtesy of LOUD 
Technologies Inc., 
www.mackie.com.) 
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FIGURE 7.17 

Pro Tools HD with 192 1/0 interface. (Courtesy of Digidesign, a división of Avid, www.digidesign.com.) 


■ Fitted with a controller surface (with or without motorized faders) that 
provides for hands-on DAW operation. 

These devices may be designed as hardware cards that fit directly into the Com¬ 
puter, or they might plug into the system via USB or FireWire. An interface might 
have as few as two inputs and two outputs, or it might have as many as 24. It 
might offer a limited number of sample rate and bit-depth options, or it might 
be capable of handling rates up to 96 kHz/24 bits or higher. Unless you buy a 
system that's been designed to opérate with a specific piece of hardware (most 
notably, those offered by Digidesign), you should weigh the vast number of 
interface options and capabilities with patience and care—the options could 
easily affect your future expansión choices. 

Audio driver protocols 

Audio driver protocols are software programs that set standards for allowing 
data to be communicated between the system's software and hardware. A few 
of the more common protocols are: 

■ WDM: The Windows Driver Model is a robust driver implementation 
that's directly supported by Microsoft Windows. Software and hardware 
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that conform to this basic standard can communicate audio to and from 
the computer's basic audio ports. 

■ ASIO: The Audio Stream Input/Output architecture forms the backbone 
of VST. It does this by supporting variable bit depths and sample rates, 
multichannel operation, and synchronization. This commonly used pro- 
tocol offers low latency, high performance, easy setup and stable audio 
recording within VST. 

■ MAS: The MOTU Audio System is a system extensión for the Mac that uses 
an existing CPU to accomplish multitrack audio recording, mixer, bussing 
and real-time effects processing. 

■ CoreAudio: This driver allows compatible single-client, multichannel appli- 
cations to record and play back through most audio interfaces using Mac 
OS X. It supports full-duplex recording and playback of 16-/24-bit audio 
at sample rates up to 96 kHz (depending on your hardware and CoreAu¬ 
dio client application). 

In most circumstances, it won't be necessary for you to be familiar with the 
protocols—you just need to be sure that your software and hardware are com¬ 
patible for use with the driver protocol that works best for you. Of course, 
further information can always be found at the respective companies' websites. 

Latency 

When discussing the audio interface as a production tool, it's important that 
we touch on the issue of latency. Quite literally, latency refers to the buildup of 
delays (measured in milliseconds) in audio signáis as they pass through the 
audio circuitry of the audio interface, CPU, internal mixing structure and I/O 
routing chains. When monitoring a signal directly through a computer's signal 
path, latency can be experienced as short delays between the input and moni- 
tored signal. If the delays are excessive, they can be unsettling enough to throw 
a performer off time. For example, when recording a synth track, you might 
actually hear the delayed monitor sound shortly after hitting the keys (not a 
happy prospect). Because we now have faster computers, improved audio 
drivers, and better programming, latency has been reduced to such low levels 
that it's not even noticeable. For example, the latency of a standard Windows 
audio driver can be truly pitiful (upward to 500 ms). By switching to a sup- 
ported ASIO driver and by optimizing the interface/DAW buffer settings to their 
lowest operating size (without causing the audio to stutter), these del ay valúes 
can be reduced down to an unnoticeable range. 

DAW CONTROLLERS 

One of the more common complaints against the digital audio editor and 
workstation environment (particularly when relating to the use of on-screen 
mixers) is the lack of a hardware controller that gives the user access to hands- 
on Controls. In recent years, this has been addressed by major manufacturers 
and third-party companies in the form of a hardware DAW controller interface 
(Figures 7.18 through 7.21). 
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FIGURE 7.18 

Mackie Control 
Universal DAW 
controller. (Courtesy 
of Loud Technolo¬ 
gies, Inc., www. 
mackie.com.) 



FIGURE 7.19 

Tascam FW-1884 
FireWire audio/MIDI 
interface and control 
surface. (Courtesy of 
Tascam, www. 
tascam.com.) 


These controllers generally mimic the design of an audio mixer in that they 
offer slide or rotary gain faders, pan pots, solo/mute and channel select but- 
tons—with the added bonus of a full transport remóte. A channel select button 
is used to actively assign a specific channel to a section that contains a series of 
grouped pots and switches that relate to EQ, effects and dynamic functions. 
These controllers offer direct mixing control over eight input strips at a time. 
By switching between the banks in groups of 8 (1-8, 9-16, 17-24, ...), any 
number of the grouped inputs can be accessed by the virtual hardware mixer. 
These devices will also often inelude software function keys that can be pro- 
grammed to give quick and easy access to the DAW's more commonly used 
program keys. 
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FIGURE 7.20 

Digidesign 003 
factory-integrated 
FireWire interface 
rack and control 
surface. (Courtesy of 
Digidesign, a división 
of Avid Technology, 
Inc., www.digidesign. 
com.) 



FIGURE 7.21 

Digidesign ICON 
integrated controller/ 
consolé. (Courtesy of 
Digidesign, a división 
of Avid Technology, 
Inc., www.digidesign. 
com.) 



Controller commands are most commonly transmitted between the controller 
and audio editor via device-specific MIDI SysEx messages. Therefore, in order 
to be able to intégrate a controller into your system, the DAW's current versión 
must be specifically programmed to accept the control codes from a particular 
controller, unless the DAW and controller make use of a new plug-in architec- 
ture that allows compatible devices to be freely connected. Most controller 
surfaces communicate these messages to the DAW host via the easy-to-use USB 
or FireWire protocols. 




























































The Digital Audio Workstation 4APT iR 


Certain controllers also offer all-in-one capabilities that can be straightforward 
and cost-effective devices for first-time buyers. Often, these devices inelude a 
multichannel audio interface, MIDI interface ports, monitor capabilities and 
full controller functions. Others may already have an existing digital mixer that 
can actually be used as a fully functional controller (and in certain circum- 
stances, as a multichannel audio interface) when connected to a DAW host 
program. For these and other reasons, taking the time to research your needs 
and current equipment capabilities can save you both time and money. 


It’s important to note that there are numerous controllers 
from which to choose—and just because others feel that 
the mouse ¡s cumbersome doesn’t mean that you have to 


feel that way; for example, I have several controllers in 
my own studio, but the mouse ¡s still my favorite tool. As 
always, the choice ¡s totally up to you. 


SOUND FILE FORMATS 

An amazingly varied number of sound file formats exist within audio and mul¬ 
timedia production. Here is a list of the most commonly used audio production 
formats that don't use data compression: 

■ Wave (.ivav): The Microsoft Windows format supports both mono and 
stereo files at a variety of bit and sample rates. WAV files contain PCM 
coded audio (uncompressed pulse-code modulation formatted data) that 
follows the Resource Information File Format (RIFF) spec, which allows 
extra user information to be embedded and saved within the file itself. 

■ Broadcast wave (.wav): In terms of audio contení, broadeast wave files are 
the same as regular wave files; however, text strings for supplying addi- 
tional information can be embedded in the file according to a standard- 
ized data format. 

■ Wave64 (. w64): This proprietary format was developed by Sonic Foundry, 
Inc. (now operating under the Sony ñame). In terms of audio quality, 
Wave64 files are identical to wave files, except that their file headers use 
64-bit valúes, whereas wave uses 32-bit valúes. As a result, Wave64 files can 
be considerably larger than standard wave files, and this format is a good 
choice for long recordings (e.g., surround files and file sizes over 2 GB). 

■ Apple AIFF (.aif or .snd): This standard sound file format from Apple sup¬ 
ports mono or stereo, 8-bit or 16-bit audio at a wide range of sample rates. 
Like broadcast wave files, AIFF files can contain embedded text strings. 

■ Sound Designer I and II (.sd and . sd2): Sound Designer is a sound file format 
that's used by Digidesign. SDI was first released in 1985 and can still be 
found on many CD-ROM and sound file dises; it was primarily used to 
store 16-bit, mono samples of short duration (often on the order of 
seconds). In its latest incarnation, SDII can be used to encode 16- or 24-bit 
sound files of any practical length at a variety of sample rates. 
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Format interchange and compatibility 

At the sound file level, most software editors and DAWs are able to read a wide 
range of uncompressed and compressed formats, which can then be saved into 
a new format. At the session level, there are several standards that allow for the 
exchange of data for an entire session from one platform, OS or hardware device 
to another. These inelude the following: 

■ The Advanced Authoring Format (AAF; www.aafassociation.org) is a mul¬ 
timedia file format that's used to exchange digital media and metadata 
between different systems and applications across múltiple platforms. 
Designed by the top media software companies, this format will greatly 
help media creators of all types by allowing them to exchange projeets 
between applications without losing valuable metadata such as fades, 
automation and processing information. 

■ Open Media Framework Interchange (OMFI) is a platform-independent 
session file format intended for the transfer of digital media between dif¬ 
ferent DAW applications; it is saved with an .omf file extensión. OMF (as 
it is commonly called) can be saved in either of two ways: (1) "export all 
to one file," when the OMF file ineludes all of the sound files and session 
references that are included in the session (be prepared for this file to be 
extremely large), or (2) "export media file references," when the OMF file 
will not contain the sound files themselves but will contain all of the ses- 
sion's región, edit and mix settings; effeets (relating to the receiving DAW's 
available plug-ins and ability to transíate effeets routing); and 1/0 settings. 
This second type of file will be small by comparison; however, the original 
sound files must be transferred into the session folders. 

■ Developed by the Audio Engineering Society, the AES31 standard is an 
open file interchange format that was designed to overeóme format incom- 
patibility issues between different software and hardware systems. Trans¬ 
ferred files will retain event positions, mix settings, fades, etc. AES31 makes 
use of Microsoft's FAT32 file system, with broadeast wave as the default 
audio file format. This means that an AES31 file can be transferred to any 
DAW that supports AES31, regardless of the type of hardware and software 
used, as long as the workstation can read the FAT32 file system, broadeast 
wave or regular wave files. 

■ OpenTL is a file exchange format that was developed for Tascam hard-disk 
recording systems. An imported OpenTL project file will contain all audio 
files and edits that were made within the Tascam system, with all events 
positioned correctly in the project window. Conversely, a session can be 
edited and then exported to a disk in the OpenTL format, making it pos- 
sible to transfer all edits and audio files back to the Tascam hard-disk 
device. 

Although these systems for allowing file and session interchangeability between 
workstation types can be a huge time and work saver, it should be pointed out 
that they are more often than not far from perfect. It's not uncommon for files 
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not to line up properly (or load at all), plug-ins can disappear and/or lose their 
settings ... you ñame it, it could happen. As a result, the most highly recom- 
mended and surefire way to make sure that a session will load into any DAW 
platform is to make (print or export) a copy of each track, starting from the 
session's beginning (00:00:00:00) and going to the end of that particular track. 
LJsing this system, all you need to do is load each track into the new worksta¬ 
tion at their respective track beginning points and get to work. 

Of course, the above method won't load any of the plug-in or mixer settings 
(often an interchange problem anyway). Therefore, it's extremely important that 
you properly document the original session, making sure that: 

■ All tracks have been properly named (supplying additional track notes 
and documentation, if needed). 

■ All plug-in ñames and settings are well documented (a screenshot can go 
a long way toward keeping track of these settings). 

■ Any virtual instrument or MIDI tracks are printed to an audio track. (Make 
sure to inelude the original MIDI files in the session, and to document all 
instrument ñames and settings ... again, screenshots can help.) 

■ Any special effeets or automation moves are printed to the particular track 
in question (make sure this is well documented) and you should definitely 
consider providing an additional copy of the track without effeets or 
automation. 

Sound file sample and bit rates 

While the sample rate of a recorded bitstream (samples per second) directly 
relates to the resolution at which a recorded sound will be digitally captured, 
the bit rate of a digitally recorded sound file directly relates to the number of 
quantization steps that are encoded into the bitstream. It's important that these 
rates be determined and properly set before starting a session. Further reading 
on sample and bit rate depths can be found in Chapter 6. Additional info on 
sound files and compression codees can be found in Chapter 10 (see Table 10.2 
for a listing of uncompressed audio bit rate and file sizes). 


DAW SOFTWARE 

Probably one of the strongest playing cards in the modern digital audio deck 
is the digital audio workstation. By their very nature, DAWs (Figures 7.22 
through 7.25) are software programs that intégrate with Computer hardware and 
functional applications to create a powerful and flexible audio production envi- 
ronment. These programs commonly offer extensive record, edit and mixdown 
facilities through the use of such production tools as: 

■ Extensive sound file recording, edit and región definition and placement 

■ MIDI sequencing and scoring 

■ Real-time, on-screen mixing 

■ Real-time effeets 


DAW Software 


FIGURE 7.22 

Sonar 7 Producer 
edition. (Courtesy 
of Cakewalk, Inc., 
www.cakewalk.com.) 
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FIGURE 7.23 

Mark of the Unicom 
Digital Performer. 
(Courtesy of MOTU, 
Inc., www.motu. 
com.) 
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■ Mixdown and effects automation 

■ Sound file import/export and mixdown export 

■ Support for video/picture synchronization 

■ Systems synchronization 

■ Audio, MIDI and sync Communications with other audio programs (e.g., 
ReWire) 
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FIGURE 7.24 

Nuendo 4 media 
production DAW. 
(Courtesy of 
Steinberg Media 
Technologies GmbH, 
www.steinberg.net.) 


■ Audio, MIDI and sync Communications with other software instruments 
(e.g., VST technology). 

This list is but a smattering of the functional capabilities that can be offered by 
an audio production DAW. 

Suffice it to say that these powerful software production tools are extremely 
varied in their form and function. Even with their inherent strengths, quirks, 
and complexities, their basic look, feel, and operational capabilities have, to 
some degree, become unified among the major DAW competitors. Having said 
this, there are enough variations in features, layout, and basic operation that 
individuáis—from aspiring beginner to seasoned professional—will have their 
favorite DAW make and model. With the growth of the DAW and Computer 
industries, people have begun to customize their computers with features, 
added power and peripherals that rival their love for souped-up cars and motor- 
cycles. In the end, though (as with many things in life), it doesn't matter which 
type of DAW you use—it's how you use it that counts! 

Sound file recording, editing, región definition and placement 

Most digital audio workstations are capable of recording sound files in mono, 
stereo, surround or multichannel formats (either as individual files or as a single 
interleaved file). These production environments graphically display sound file 
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FIGURE 7.25 

Pro Tools HD hard-disk recording software for the Mac or PC. (Courtesy of Digidesign, a división of Avid Technology, 
Inc., www.digidesign.com.) 


information within a main graphic window (Figure 7.26), which contains 
drawn waveforms that graphically represent the amplitude of a sound file over 
time in a WYSIWYG (what you see is what you get) fashion. Depending on the 
system type, sound file length and the degree of zoom, the entire waveform may 
be shown on the screen. Or, only a portion will show, but the waveform will 
continué to scroll ofif one or both sides of the screen. Graphic editing differs 
greatly from the "razor blade" approach that's used to cut analog tape in that 
the waveform gives us both visual and audible cues as to precisely where an 
edit point should be. Using this common display technique, any position, cut/ 
copy/paste, gain and time changes to the waveform will be instantly reflected 
on the screen. Usually, these edits are nondestructive (a process whereby the 
original file isn't altered—only the way in which the región in/out points are 
accessed or the file is processed as to gain, spectrum, etc.). 

Only when a waveform is zoomed-in fully is it possible to see the individual 
waveshapes of a sound file (Figure 7.27). At this zoom level, it becomes simple 
to lócate zero-crossing points (points where the level is at the 0, center-level 
line). In addition, when a sound file is zoomed-in to a level that shows indi¬ 
vidual sample points, the program might allow the sample points to be redrawn 
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in order to remove potential offenders (such as clicks and pops) or to smooth 
out amplitude transitions between loops or adjacent regions. 

The nondestructive edit capabilities of a DAW refer to a disk-based system's 
ability to edit a sound file without altering the data that was originally recorded 
to disk. This important capability means that any number of edits, alterations 
or program versions can be performed and saved to disk without altering the 
original sound file data. 
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FIGURE 7.26 

Main edit window 
within the Cubase 4 
audio production 
software. (Courtesy 
of Steinberg Media 
Technologies GmbH, 
a división of Yamaha 
Corporation, www. 
steinberg.net.) 


FIGURE 7.27 

Zoomed-in edit 
window showing 
individual samples. 
(Courtesy of 
Steinberg Media 
Technologies GmbH, 
a división of Yamaha 
Corporation, www. 
steinberg.net.) 
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FIGURE 7.28 

Nondestructive 
editing allows a 
región within a larger 
sound file to begin at 
a specific point and 
play until the 
endpoint is reached. 


| begin playback |end playback 



Frankly, my dear... I don't give a damn! 







my dear...Frankly, I don't give a damn! 



I don't give a damn! my dear...Frankly, 



FIGURE 7.29 

Example of how 
snippets from Rhett’s 
famous Gone with 
the Wind dialogue 
can be easily 
rearranged. 


D I Y 

do it yourself 


Nondestructive editing is accomplished by 
accessing defined segments of a recorded digital 
audio file (often called regions) and allowing 
them to be reproduced in a user-defined order or 
segment length in a manner other than was orig- 
inally recorded. In effect, when a specific región 
is defined, we're telling the program to access the 
sound file at a point that begins at a specific 
memory address on the hard disk and continúes 
until the ending address has been reached (Figure 
7.28). Once defined, these regions can be inserted 
into the list (often called a playlist or editlist) in 
such a way that they can be accessed and repro¬ 
duced in any order. For example, Figure 7.29 
shows a snippet from Gone With the Wind that 
contains the immortal words "Frankly, my dear, 
I don't give a damn." By segmenting it into three 
regions we could request that the DAW editor 
output the words in several ways. 

When working in a graphic editing environment, 
regions can usually be defined by positioning the 


Tutorial: Recording a Sound File to Disk 


1. Download a demo copy of your favorite DAW (these 
are generally available off the company’s website for 
free). 

2. Download the workstation’s manual and familiarize 
yourself with ¡ts functional operating basics. 

3. Consult the manual regarding the recording of a 
sound file. 

4. Assign a track to an ¡nterface input sound source. 

5. Ñame the track! It’s almost always best to ñame the 
track (or tracks) before going into record. In this way 


the file will be saved to disk within the session folder 
under a descriptive ñame instead of an automatically 
generated file ñame (e.g., killerkick.wav instead of 
track16-01 .wav). 

6. Save the session and assign the input to another 
track, and overdub a track along with the previously 
recorded track. 

7. Repeat as necessary until you’re having fun! 

8. Save your final results for the next tutorial. 
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región to be col 









FIGURE 7.30 

Cutting inserís the highlighted región into memory and 
deletes the selected data. 


región to be copied 



FIGURE 7.31 

Copying simply places the highlighted región into memory 
without changing the selected waveform in any way. 


cursor over the waveform, pressing and holding 
the mouse or trackball button and then drag- 
ging the cursor to the left or right, which high- 
lights the selected región for easy identification. 
After the región has been defined, it can be edited, 
marked, named, maimed or otherwise processed. 

As one might expect, the basic cut-and-paste 
techniques used in hard-disk recording are 
entirely analogous to those used in a word 
processor or other graphics-based programs: 


¡nsert point 





II 



FIGURE 7.32 

Pasting copies the 
data within the 
system’s clipboard 
memory into the 
sound file at the 
current cursor 
position. 


■ Cut: Places the highlighted región into clip¬ 
board memory and deletes the selected 
data (Figure 7.30). 

■ Copy: Places the highlighted región into memory and doesn't alter the 
selected waveform in any way (Figure 7.31). 

■ Paste: Copies the waveform data that's within the system's clipboard 
memory into the sound file beginning at the current cursor position 
(Figure 7.32). 


ETI Y 

do it yourself 


1. Open the session from the preceding tutorial, “Record¬ 
ing a Sound File to Disk.” 

2. Consult your editor’s manual regarding basic cut-and- 
paste commands (which are almost always the 
standard PC and Mac commands). 

3. Open a sound file and define a región that ¡ncludes a 
musical phrase, lyric or sentence. 


Cut the región and try to paste it into another point in 
the sound file in a way that makes sense (musical or 
otherwise). 

5. Feel free to cut, copy and paste to your heart’s desire 
to create an interesting or totally wacky sound file. 


Tutorial: Copy and Paste 

4. 
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Besides basic nondestructive cut-and-paste editing techniques, the amplitude 
Processing of a signal is one of the most common types of changes that are 
likely to be encountered. These inelude such processes as gain changing, nor- 
malization and fading. 

Gain changing relates to the altering of a región or track's overall amplitude 
level, such that a signal can be proportionally increased or reduced to a specified 
level (often in dB or percentage valué). To increase a sound file or region's 
overall level, a function known as normalization can be used. Normalization 
(Figure 7.33) refers to an overall change in a sound file or defined region's signal 
level, whereby the file's greatest amplitude will be set to 100% full scale (or a 
set percentage level of full scale), with all other levels in the sound file or región 
being proportionately changed in gain level. 

The fading of a región (either in or out, as shown in Figure 7.34) is accom- 
plished by increasing or reducing a signal's relative amplitude over the course 
of a defined duration. For example, fading in a file proportionately increases a 
region's gain from infinity (zero) to full gain. Likewise, a fade-out has the oppo- 
site effect of creating a transition from full gain to infinity. These DSP functions 
have the advantage of creating a much smoother transition than would other- 
wise be humanly possible when performing a manual fade. 


FIGURE 7.33 

Original signal and 
normalized signal 
level. 



FIGURE 7.34 

Examples of fade-in 
and fade-out curves. 
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FIGURE 7.35 

Example of a 
cross-faded sound 
file. (Courtesy of 
Steinberg Media 
Technologies GmbH, 
a división of Yamaha 
Corporation, www. 
steinberg.net.) 


A cross-fade (or X-fade) is often used to smooth the transition between two 
audio segments that either are sonically dissimilar or don't match in amplitude 
at a particular edit point (a condition that would otherwise lead to an audible 
"click" or "pop"). This functional tool basically overlaps a fade-in and fade-out 
between the two waveforms to create a smooth transition from one segment to 
the next (Figure 7.35). Technically, this process averages the amplitude of the 
signáis over a user-definable length of time in order to mask the offending edit 
point. 

FIXING THE SOUND WITH A SONIC SCALPEL 

In traditional multitrack recording, should a mistake or bad take be recorded 
onto a new track, it's a simple matter to start over and re-record over the 
unwanted take. However, if only a small part of the take was bad, it's easy to go 
back and perform a punch-in (Figure 7.36). During this process, the recorder 
or DAW: 

■ Silently enters into record at a predetermined point 

■ Records over the unwanted portion of the take 

■ Silently falls back out of record at a predetermined point. 

A punch can be manually performed on most recording systems; however, 
newer tape machines and DAW systems can be programmed to automatically 
go into and fall out of record at a predetermined time. 
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FIGURE 7.36 

Punch-ins let you 
selectively replace 
material and correct 
mistakes. (Courtesy 
of Steinberg Media 
Technologies GmbH, 
a división of Yamaha 
Corporation, www. 
steinberg.net.) 
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When punching-in, any number of variables can come 
into play. If the instrument ¡s an overdub, it’s often easy 
to punch the track without fear of any consequences. If 
an offending instrument is part of a group or ensemble, 
leakage from the original instrument could find its way 
into adjacent tracks, making a punch difficult or impos- 
sible. In such a situation, it’s usually best to re-record the 
piece, pick up at a point just before the bad section and 
splice or insert ¡t into the original recording or attempt to 
punch the section using the entire ensemble. 

From a continuity standpoint, it’s often best to 
punch-in on a section immediately after the take has 


been recorded, because changes in mic choice, mic 
position or the general “vibe” of the session can lead to a 
bad punch that’ll be hard to correct. If this isn’t possible, 
make sure that you carefully document the mic choice, 
placement, preamp type, etc., before moving onto the 
next section: You’ll be glad you did. 

Performing a “punch” should always be done with 
care. Allowing the track to continué recording too long a 
passage could possibly cut off a section of the following, 
acceptable track and likewise require that the following 
section be redone. Stopping ¡t short could cut off the 
natural reverb trail of the final note. 


It needs to be pointed out that performing a punch using a DAW is often "far" 
easier than doing the same on an analog recorder. For example: 

■ If the overdub wasn't that great, you can simply click to "undo" it and 
start over! 

■ If the overdub was started early and cut into the good take (or went too 
long), the leading and/or trailing edge of the punch can be manually 
adjusted to expose or hide sections after the punch has been performed 
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composite track 


FIGURE 7.37 

A single composite 
track can be created 
from several partially 
acceptable takes. 


These beneficial luxuries can go a long way toward reducing operator error—and 
its associated tensions—during a session. 

COMPING 

When performing a musically or technically complex overdub, most DAWs will 
let you comp (short for composite) múltiple overdubs together into a final, 
master take (Figure 7.37). Using this process, a DAW can be programmed to 
automatically enter into and out of record at the appropriate points. When 
placed into record mode, the DAW will start laying down the overdub into a 
new and sepárate track. At the end of the overdub, it'll loop back to the begin- 
ning and start recording the next pass onto a sepárate track. This process of 
laying down consecutive takes will continué, until the best take is done or the 
artist gets tired of recording. Once done, an entire overdub might be chosen, or 
individual segments from the various takes might be assembled together into 
a final, master overdub. Such is the life of a digital microsurgeon! 

MIDI sequencing and scoring 

Most DAWs inelude extensive support for MIDI (Figures 7.38 and 7.39), allow- 
ing electronic instruments, controllers, effeets devices, and electronic music 
software to be integrated with multitrack audio and video tracks. This important 
feature often ineludes the full implementation for: 
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FIGURE 7.38 

Steinberg Cubase/ 
Nuendo piano roll 
edit window. 
(Courtesy of 
Steinberg Media 
Technologies GmbH, 
a división of Yamaha 
Corporation, www. 
steinberg.net.) 



FIGURE 7.39 

Steinberg Cubase/ 
Nuendo notation edit 
window. (Courtesy of 
Steinberg Media 
Technologies GmbH, 
a división of Yamaha 
Corporation, www. 
steinberg.net.) 
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■ MIDI sequencing, processing and editing 

■ Score editing and printing 

■ Drum pattern editing 

■ MIDI signal processing 

■ Support for linking the timing and I/O elements of an external music 
application (ReWire) 

■ Support for software instruments (VSTi and RTAS). 

Further reading about the wonderful world of MIDI can be found in 
Chapter 9. 

SUPPORT FOR VIDEO AND PICTURE SYNC 

Most high-end DAWs also inelude support for displaying a video track within 
a session, both as a video window that can be displayed on the desktop and in 
the form of a video thumbnail track (which often appears as a linear guide 
track). Through the use of SMPTE timecode, MTC and wordclock, external video 
players and edit devices can be locked with the workstation's timing elements, 
allowing for full "mix to picture" capabilities (Figure 7.40). 

Real-time, on-screen mixing 

In addition to their ability to offer extensive región edit and definition, one of 
the most powerful cost- and time-effective features of a digital audio worksta¬ 
tion is its ability to offer on-screen mix capabilities (Figures 7.41 and 7.42), 
known as "mixing inside the box." Essentially, most DAWs inelude an extensive 
digital mixer interface that offers most (if not all) of the capabilities that are 



FIGURE 7.40 

Most high-end DAW 
systems are capable 
of importing a video 
file directly into the 
project session 
window. 
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FIGURE 7.41 

Nuendo on-screen 
mixer. (Courtesy of 
Steinberg Media 
Technologies GmbH, 
a división of Yamaha 
Corporation, www. 
steinberg.net.) 



FIGURE 7.42 

ProTools M-Powered 
on-screen mixer with 
layout details. 
(Courtesy of 
Digidesign, a división 
of Avid Technology, 
Inc., www.digidesign. 
com.) 
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offered by larger analog and/or digital consoles—without the price tag. In 
addition to the basic input strip fader, pan, solo/mute and select Controls, most 
DAW software mixers offer broad support for EQ, effects plug-ins (offering a 
staggering amount of DSP flexibility), spatial positioning (pan and possibly 
surround-sound positioning), total automation (both mixer and plug-in auto- 
mation), external mix, function and transport control from a supported external 
hardware controller, support for exporting a mixdown to a file ... the list goes 
on and on and on. Further reading on the mixers, consoles and the process of 
mixing audio can be found in Chapter 13. 

DSP EFFECTS 

In addition to being able to cut, copy and paste regions within a sound file, it's 
also possible to alter a sound file, track or segment using digital signal Process¬ 
ing techniques. In short, DSP works by directly altering the samples of a sound 
file or defined región according to a program algorithm (a set of programmed 
instructions) in order to achieve a desired result. These processing functions can 
be performed either in real time or non-real time (offline): 

■ Real-time DSP: Commonly used in most modern-day DAW Systems, this 
process makes use of the computer's CPU or additional acceleration hard¬ 
ware to perform complex DSP calculations during actual playback. Because 
no calculations are written to disk in an offline fashion, significant savings 
in time and disk space can be realized when working with productions 
that involve complex or long processing events. In addition, the automa¬ 
tion instructions for real-time processing are embedded within the saved 
session file, allowing any effect or set of parameters to be changed, undone 
and redone without affecting the original sound file. 

■ Non-real-time DSP: Using this method, signal processing (such as changes 
in level, EQ, dynamics or reverb) that is too calculation intensive to be 
carried out during playback will be calculated (in an offline fashion). In 
this way, the newly calculated file (containing the effect, submix, etc.) will 
be played back, without having to use up the extra resources that are now 
available to the CPU for other functions. DAWs will often have a specific 
term for tracks or processing functions that have been written to disk, such 
as "locking" or "freezing" a file. When DSP is performed in non-real time, 
it's almost always wise to save both the original and the affected sound 
files, just in case you need to make changes at a later time. 

Most DAWs offer an extensive array of DSP options, ranging from options that 
are built into the basic I/O path of the input strip (e.g., basic EQ and gain-related 
functions) to DSP effects and plug-ins that come bundled with the DAW package, 
to third-party effects plug-ins that can be either inserted directly into the signal 
path (direct insertion) or offered as a master effect path that numerous tracks 
can be assigned to and/or mixed into (side chain). Although the way in which 
effects are implemented in a DAW will vary from one make and model to the 
next, the basic fundamentáis will be much the same. The following notes 
describe but a few of the possible effects that can be plugged into the signal 
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FIGURE 7.43 

4-Band EQII plug-in 
for Pro Tools. 
(Courtesy of 
Digidesign, a división 
of Avid Technology, 
Inc., www.digidesign. 
com.) 
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FIGURE 7.44 

EQ plug-in for 
Cubase/Nuendo. 
(Courtesy of 
Steinberg Media 
Technologies GmbH, 
a división of Yamaha 
Corporation, www. 
steinberg.net.) 



path of DAW; however, further reading on effects processing can be found in 
Chapter 14 (Signal Processing). 

■ Equalization: EQ is, of course, a feature that's often implemented at the 
basic level of a virtual input strip (Figures 7.43 and 7.44). Most Systems 
give full parametric control over the entire audible range, offering overlap- 
ping control over several bands with a variable degree of bandwidth 
control (Q). Beyond the basic EQ options, many third-party EQ plug-ins 
are available on the market that vary in complexity, musicality and market 
appeal (Figures 7.45 and 7.46). 

■ Dynamic range: Dynamic range processors (Figures 7.47 and 7.48) can be 
used to change the signal level of a program. Processing algorithms are 
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FIGURE 7.45 

Cambridge equalizer 
and filters for the 
UAD effects Process¬ 
ing card. (Courtesy of 
Universal Audio, 
www.uaudio.com, 

© 2009 Universal 
Audio, Inc. All rights 
reserved. Used with 
permission.) 
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FIGURE 7.46 

Dynamic EQ for the 
Powercore DSP 
processor and 
plug-in system. 
(Courtesy of TC 
Electronic A/S, www. 
tcelectronic.com.) 


available that emulate a compressor (a device that reduces gain by a ratio 
that's proportionate to the input signal), limiter (reduces gain at a fixed 
ratio above a certain input threshold), or expander (increase the overall 
dynamic range of a program). These gain changers can be inserted directly 
into a track, used as a grouped master effect, or inserted into the final 
output path for use as a master gain processing block. 

In addition to the basic complement of dynamic range processors, wide assort- 
ments of multiband dynamic plug-in processors (Figure 7.49) are available for 
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FIGURE 7.47 

Precisión Limiter for the UAD effects Processing card. (Courtesy of Universal Audio, www.uaudio.com, © 2009 
Universal Audio, Inc. All rights reserved. Used with permission.) 


FIGURE 7.48 

Fabrik C for the 
Powercore DSP 
processor and 
plug-in system. 
(Courtesy of TC 
Electronic A/S, www. 
tcelectronic.com.) 
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general and mastering DSP applications. These processors allow the overall 
frequency range to be broken down into various frequency bands. For example, 
a plug-in such as this could be inserted into a DAW's main output path, which 
allows the lows to be compressed while the mids are lightly limited and the 
highs are de-essed to reduce sibilance. 

■ Delay: Another important effects category that can be used to alter and/or 
augment a signal revolves around delays and regeneration of sound over 
time. These time-based effects use delay (Figures 7.50 and 7.51) to add a 
perceived depth to a signal or change the way that we perceive the dimen¬ 
sional space of a recorded sound. A wide range of time-based plug-in 
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FIGURE 7.49 

MasterX5 48k 
multiband dynamics 
for the Powercore 
DSP processor and 
plug-in system. 
(Courtesy of TC 
Electronic A/S, www. 
tcelectronic.com.) 
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FIGURE 7.50 

DoubleDelay VST 
plug-in. (Courtesy 
of Steinberg Media 
Technologies GmbH, 
a división of Yamaha 
Corporation, www. 
steinberg.net.) 
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FIGURE 7.51 

DM-1 Delay 
Modulator for the 
LIAD effects Process¬ 
ing card. (Courtesy 
of Universal Audio, 
www.uaudio.com, 

© 2009 Universal 
Audio, Inc. All rights 
reserved. Used with 
permission.) 



effects exist that are all based on the use of delay (and/or regenerated 
delay) to achieve such results as: 

■ Delay 

■ Chorus 

■ Flanging 

■ Reverb 

Further reading on the subject of delay (and the subject of signal processing in 
general) can be found in Chapter 14 (Signal Processing). 

■ Pitch and time change: Pitch change functions make it possible to shift the 
relative pitch of a defined región or track either up or down by a specific 
percentage ratio or musical interval. Most Systems can shift the pitch of a 
sound file or defined región by determining a ratio between the current 
and the desired pitch and then adding (lower pitch) or dropping (raise 
pitch) samples from the existing región or sound file. In addition to raising 
or lowering a sound file's relative pitch, most Systems can combine vari¬ 
able sample rate and pitch shift techniques to alter the duration of a región 
or track. These pitch- and time-shift combinations make it possible for 
such changes as: 

■ Pitch shift only: A program's pitch can be changed while recalculating 
the file so that its length remains the same. 

■ Change duration only: A program's length can be changed while shifting 
the pitch so that it matches that of the original program. 

■ Change in both pitch and duration: A program's pitch can be changed 
while also having a corresponding change in length. 

When combined with shifts in time (delay), changes in pitch make it possible 
for a multitude of effects to be created (such as flanging, which results from 
random fluctuations in delay and time shifts that are mixed with the original 
signal to create an ethereal "phasey" kind of sound). 

D5P PLUG-INS 

Workstations often offer a number of DSP effects that come bundled with the 
program; however, a staggering range of third-party plug-in effects can be 
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inserted into a signal path which perform functions for any number of tasks 
ranging from the straightforward to the wild-'n-zany. These effects can be pro- 
grammed to seamlessly intégrate into a host DAW application that conforms to 
such plug-in platforms as: 

■ DirectX: A DSP platform for the PC that offers plug-in support for sound, 
music, graphics (gaming) and network applications running under Micro¬ 
soft Windows (in its various OS incarnations) 

■ AU (Audio Units): Developed by Apple for audio and MIDI technologies 
in OS X; allows for a more advanced GUI and audio interface 

■ VST (Virtual Studio Technology): A native plug-in format created by Stein- 
berg for use on either a PC or Mac; all functions of a VST effect processor 
or instrument are directly controllable and automatable from the host 
program 

■ MAS (MOTU Audio System): A real-time native plug-in format for the Mac 
that was created by Mark of the Unicom as a proprietary plug-in format 
for Digital Performer; MAS plug-ins are fully automatable and do not 
require external DSP in order to work with the host program 

■ AudioSuite: A file-based plug-in that destructively applies an effect to a 
defined segment or entire sound file, meaning that a new, affected versión 
of the file is rewritten in order to conserve on the processor's DSP overhead 
(when applying AudioSuite, it's often wise to apply effects to a copy of 
the original file so as to allow for future changes) 

■ RTAS (Real-Time Audio Suite): A fully automatable plug-in format that 
was designed for various flavors of Digidesign's Pro Tools and mns 
on the power of the host CPU (host-based processing) on either the 
Mac or PC. 

■ TDM (Time Domain Multiplex): A plug-in format that can only be used 
with Digidesign Pro Tools Systems (Mac or PC) that are fitted with Digide- 
sign Farm cards; this 24-bit, 256-channel path integrates mixing and real- 
time digital signal processing into the system with zero latency and under 
full automation 

These popular software applications (which are being programmed by major 
manufacturers and third-party startups alike) have helped to shape the face of 
hard-disk recording by allowing us to pick and choose the plug-ins that best fit 
our personal production needs. As a result, new companies, ideas and task- 
oriented producís are constantly popping up on the market, literally on a 
monthly basis. 

REWIRE 

ReWire and ReWire2 are special protocols that were developed by Propellerhead 
Software and Steinberg to allow audio to be streamed between two simultane- 
ously mnning Computer applications. Unlike a plug-in, where a task-specific 
application is inserted into a compatible host program, ReWire allows the audio 
and timing elements of an independent client program to be seamlessly inte- 
grated into another host program. In essence, ReWire provides virtual patch 
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ReWire "Client" 


ReWire "host" 



FIGURE 7.52 

ReWire allows a Client program to be inserted into a host program (often a DAW) so the programs can run simultane- 
ously in tándem. 


chords (Figure 7.52) that link the two programs together within the Computer. 
A few of ReWire's supporting features inelude: 

■ Real-time streaming of up to 64 sepárate audio channels (256 with 
ReWire2) at full bandwidth from one program into its host program 
application 

■ Automatic sample accurate synchronization between the audio in the two 
programs 

■ An ability to allow the two programs to share a single soundeard or inter¬ 
face 

■ Linked transport Controls that can be controlled from either program 
(provided it has some kind of transport functionality) 

■ An ability to allow numerous MIDI outs to be routed from the host 
program to the linked application (when using ReWire2) 

■ A reduction of the total number of system requirements that would be 
required if the programs were run independently. 

This useful protocol essentially allows a compatible program to be plugged into 
a host program in a tándem fashion. As an example, ReWire could allow Propel- 
lerhead's Reason (client) to be "ReWired" into Steinberg's Cubase DAW (host), 
allowing all MIDI functions to pass through Cubase into Reason while patching 
the audio outs of Reason into Cubases virtual mixer inputs. For further infor- 
mation on this useful protocol, consult the supporting program manuals. 

ACCELERATOR CARD5 

In most circumstances, the CPU of a host DAW program will have sufficient 
power and speed to perform all of the DSP effeets and processing needs of a 
project. Under extreme production conditions, however, the CPU might run out 
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FIGURE 7.53 

The UAD-2 DSP PCIe DSP processor and several plug-in examples. (Courtesy of Universal Audio, www.uaudio.com, 
© 2009 Universal Audio, Inc. All rights reserved. Used with permission.) 


of computing steam and choke during real-time playback. LJnder these condi- 
tions, there are a couple of ways to reduce the workload on a CPU: On one 
hand, the tracks could be "frozen," meaning that the processing functions 
would be calculated in non-real time and then written to disk as a sepárate file. 
On the other hand, an accelerator card (Figures 7.53 and 7.54) that's capable 
of adding extra CPU power can be added to the system, giving the system extra 
Processing power to perform the necessary effects calculations. Note that in 
order for the plug-ins to take advantage of the acceleration they need to be 
specially coded for that specific DSP card. 

Mixdown and effects automation 

One of the great strengths of the "in the box" age is how easily all of the mix 
and effects parameters can be automated and recalled within a mix. The ability 
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FIGURE 7.54 

Powercore FireWire PCIe DSP processor and several plug-in examples. (Courtesy of TC Electronic A/S, www. 
tcelectronic.com.) 


to change levels, pan and virtually control any parameter within a project makes 
it possible for a session to be written to disk, saved and recalled at a second's 
notice. In addition to grabbing a control and moving it (either virtually on- 
screen or from a physical controller), another interface style for controlling 
automation parameters (known as rubberband Controls) lets you view, draw and 
edit variables as a graphic line that details the various automation moves over 
time. As with any automation moves, these rubberband settings can be undone, 
redone or recalled back to a specific point in the edit stage. Often (but not 
always), the moves within a mix can't be "undone" and reverted back to any 
specific point in the mix. In any case, one of the best ways to save (and revert 
to) a particular mix versión (or various altérnate mix versions) is simply to 
save a specific mix under a unique, descriptive session file title (e.g., 
colabs_oceanis_radiomix01.ses) and then keep on working. By the way, it's 
always wise to save your mixes on a regular basis (many a great mix has been 
lost in a crash because it wasn't saved); also, progressively saving your mixes 
under various ñame or versión numbers (... Ol.ses ... 02.ses, etc.) can come in 
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handy if you need to revert to a past mix versión.... In short, save often and 
save regularly! 

EXPORTING A MIXDOWN TO FILE 

Once your mix is ready, most DAWs Systems are able to export (print) part or 
all of a session to a single file or set of sound files (Figure 7.55). An entire 
session or defined región can be exported as a single interleaved file (containing 
múltiple channels that are encoded into a single L-R-L-R sound file), or can be 
saved as sepárate, individual (L.wav and R.wav) sound files. Of course, a sur- 
round or multichannel mix can be likewise exported as a single interleaved file, 
or as sepárate files. 

Often, the session can be exported in non-real time (a faster-than-real-time 
process that can inelude all mix, plug-in effeets, automation and virtual instru- 
ment calculations) or in real time (a process that's capable of sending and 
receiving real-time analog signáis through the audio interface so as to allow for 
the insertion of external effeets devices, etc.). Usually, a session can be mixed 
down in a number of final sound file and bit/sample rate formats. Most DAWs 
will allow third-party plug-ins to be inserted into the final (master) output 
section, allowing for the export of a session into a specific output file format. 
(For example, a discrete surround mix can be folded down into a two-channel 
Dolby Pro Logic surround-sound file, or the same file can be rendered as a Dolby 
Digital 5.1 or DTS file for insertion into a DVD video soundtrack.) 
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FIGURE 7.55 

Most DAWs are 
capable of exporting 
session sound 
files, effeets and 
automation to a final 
mixdown track. 
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POWER TO THE PROCESSOR... UHHH, PEOPLE! 

Speaking of having enough power and speed to get the job done, there are 
definitely some tips and tricks that can help you get the most out of your digital 
audio workstation. Let's take a look at some of the more important Ítems. It's 
important to keep in mind that keeping up with technology can have its tri- 
umphs and its pitfalls. No matter which platform you choose to work with, 
there's no substitute for reading, research, and talking with your peers about 
your techno needs. It's generally best to strike a balance between our needs, our 
desires, the current State of technology and the relentless push of marketing to 
grab your money. And it's usually best to take a few big breaths before making 
any important decisions. 

Get a Computer that’s powerf ul enough 

With the increased demand for higher bit/sample rate resolution, more tracks, 
more plug-ins, more of everything, you'll obviously want to make sure that your 
Computer is fast and powerful enough to get the job done in real time without 
spitting and sputtering digits. This often means getting the most up-to-date and 
powerful computer/processor system that your budget can reasonably muster. 
With the advent of 64-bit OS platforms and dual-core processors (chips that 
effectively contain two CPUs), you'll want to make sure that your hardware will 
support these features before taking the upgrade plunge. The same goes for your 
production software and driver availability. If any part of this hardware, soft¬ 
ware and driver equation is missing, the system will not be able to make use 
of these advances. Therefore, one of the smartest things you can do is research 
the type and system requirements that would be needed to opérate your produc¬ 
tion system—and then make sure that your system exceeds these figures by a 
comfortable margin so as to make allowances for future technological advances 
and the additional processing requirements that are associated with them. If 
you have the budget to add some of the extra bells and whistles that go with 
living on the cutting edge, you should take the time to research whether or not 
your system will actually be able to deliver the extra goods when these advances 
actually hit the streets. 

Make sure you have enough memory 

It almost goes without saying that your system will need to have an adequate 
amount of random access memory (RAM) and hard-disk storage in order for 
you to take full advantage of your processor's potential and your system's data 
storage requirements. RAM is used as a temporary storage area for data that is 
being processed and passed to and from the computer's central processing unit 
(CPU). Just as there's a "need for speed" within the computer's CPU, the general 
guidelines for RAM ask that we install memory with the fastest transfer speed 
that can be supported by the Computer. It's important that you install as much 
memory as your Computer and budget will allow. Installing too little RAM will 
forcé the OS to write this temporary data to and from the hard disk—a process 
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that's much slower than transfer to RAM and causes the system's overall perfor¬ 
mance to slow to a crawl. For those who are making extensive use of virtual 
sampling technology (whereby samples are transferred to RAM), it's usually a 
wise idea to throw as much RAM as is possible into the system. It should be 
noted that although PC computers and Windows XP allow for up to 4-GB RAM 
installs, making use of more than 2 GB can actually be tricky or may not be 
accessible at all. Reading and research should be undertaken when attempting 
to add large amounts of RAM to a system. 

Hard-disk requirements for a system are certainly an important consideration. 
The general considerations inelude: 

■ Need for size: Obviously, you'll want to have drives that are large enough 
to meet your production storage needs. With the use of numerous tracks 
within a session, often at sample rates of 24 bit/9 6k, data storage can 
quickly become an important consideration. 

■ Need for speed: With the current track count and sample rate requirements 
that can commonly be encountered in a DAW session, it's easy to under- 
stand how lower disk access times (the time that's required for the drive 
heads to move from one place to another on a disk and then output that 
data) becomes important. 

It's interesting to note that, in general, the disk drives with larger storage capac- 
ities will often have shorter access times and a higher data transfer rate. These 
large-capacity drives will often spin at speeds that are at least 7200 rpm, will 
have access times that are 10 ms or faster and are able to sustain a transfer rate 
of 8 to 10 MB/sec or faster. 

Keep your production media sepárate 

Whenever possible, it's important to keep your program and operating system 
data on a sepárate drive from the one that holds your production data. This is 
due to the simple fact that a Computer periodically has to check in and interact 
with both the currently running program and the OS. Should the production 
media be on the same disk, periodic interruptions in audio data will occur as 
the disk takes a moment out to go perform a program-related task (and vice 
versa), resulting in a reduction in media and program data access time, as well 
as throughput. For these reasons, it's also wise to keep the program and media 
drives off of the same data cable and onto sepárate data paths. 

Update your drivers! 

In this day and age of software revisions, it's always a good idea to go on the 
Web and search for the latest update to a piece of software or a hardware driver. 
Even if you've just bought a product new out of the box, it might easily have 
been sitting on a music store shelf for over a year. By going to the company 
website and downloading the latest versions, you'll be assured that it has the 
latest and greatest capabilities. In addition, it's always wise to save these updates 
to disk in your backup directories. This way, if you're outstanding in your field 
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and there's a hardware or software problem, you'll be able to reload the software 
or drivers and "should" be on your way in no time. 

5 Going dual monitor 

Q: How do you fit the easy visual reference of múltiple programs, documents 
and a digital audio workstation onto a single video monitor? 

A: You often don't. Those of you who rely on your Computer for recording 
and mixing, surfin', writing, etc., should definitely think about doubling 
your Computer's visual real estáte by adding an extra monitor to your 
Computer system. 

Folks who have never seen or thought much about adding a second monitor 
might be skeptical and ask, "What's the big deal?" But, all you have to do is sit 
down and start opening programs to see just how fast your screen can get filled 
up. When using a complicated production program (such as a DAW or a high- 
end graphics app), getting the job done with a single monitor can be an exercise 
in total frustration. There's just too much we need to see and not enough screen 
real estáte to show it on. The ability to quickly make a change on a virtual mixer, 
edit window or software plug-in has become more of a necessity than a luxury. 

Truth is, in this age of the Mac and Windows, adding an extra monitor is a fairly 
straightforward proposition. You could either spring for a dual-head graphics 
card that can support two monitors or you can visit your favorite Computer store 
and buy a second video card. Getting hold of a second monitor could be as 
simple as grabbing an unused one from the attic, buying a standard 17- or 
19-inch CRT model, or springing for a new LCD monitor. 

Once you've installed the hardware, the software side of building a dual- 
monitor system is relatively straightforward. For example, in Windows XP, 
right-clicking on the main desktop and selecting Properties/Settings will cali up 
a window that lets you alter the resolution settings for each monitor. Checking 
the "Extend my Windows desktop into this monitor" will extend your desktop 
across both monitors—and you should be well on your way. 

Those of you who use a laptop can also enjoy many of these benefits by plug- 
ging the second monitor into the video out and following the setup steps that 
are recommended by your computer's operating system. You should be aware 
that many laptops are limited in the way they share video memory and might 
be restricted in the resolution levels that can be selected. 

This might not seem much like a recording tip, but once you get a dual-moni¬ 
tor system going your whole approach to producing contení (of any type) on 
a Computer will instantly change and you'll quickly wonder how you ever got 
along without it! 

6 Keeping your Computer quiet 

Noise! Noise! Noise! It's everywhere! It's in the streets, in the car, and even in 
our studios. It seems like we spend all those bucks getting the best sound pos- 
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sible, only to gunk it all up by placing this big Computer box that's full of noisy 
fans and whirring hard drives smack in the middle of a critical listening area. 
Fortunately, a number of companies have begun to find ways to reduce the 
problem. Here are a few Solutions: 

■ Whenever possible, use larger, low-rpm fans to reduce noise. 

■ Certain PC motherboards come bundled with a fan speed utility 
that can monitor the CPU and case heat and adjust the fan speeds 
accordingly. 

■ Route your internal case cables carefully. They could block the flow of air, 
which can add to heat and noise problems. 

■ A growing number of hard-disk drives are available as quiet drives. Check 
the manufacturéis noise ratings. 

■ You might consider placing the Computer in a well-ventilated area, just 
outside the production room. Always pay special attention to ventilation 
(both inside and outside the Computer box), because heat is a killer that'll 
reduce the life span of your Computer. ( Note: When building my own 
studio I designed a special alcove/glass door enclosure that houses my 
main Computer—no muss, no fuss, and very little noise.) 

■ Thanks to gamers and audio-aware buyers, a number of companies exist 
that specialize in quiet Computer cases, fans and components. These are 
always fun to check out. 

Backup and archive strategies 

It's pretty much always true that it's not a matter of if a hard drive will fail, but 
when. It's not a matter of if something will happen to an irreplaceable hard 
disk, but when. When we least expect it, disaster will strike. It's our job to be 
prepared for the inevitable. This type of headache can of course be partially or 
completely averted by backing up your active program and media files, as well 
as by archiving your previously created sessions and then making sure that these 
files are also backed up. 

As previously stated, it's generally wise to keep your computéis operating 
system and program data on a sepárate hard disk (usually the boot drive) and 
then store your session files on a sepárate media drive. Let's take this as a prac- 
tical and important starting point. Beyond this premise, as most of you are quite 
aware, the basic rules of hard-disk management are extremely personal, and will 
often differ from one Computer user to the next (Figure 7.56). Given these dif- 
ferences, I'd still like to offer up some basic guidelines: 

■ It's important to keep your data (of all types) well organized, using a 
system that's both logical and easy to follow. For example, online updates 
of a program or hardware driver downloads can be placed into their own 
directories; data relating to your studio can be placed in the "studio" direc- 
tory and subdirectories; documents, MP3s, and all the trappings of day- 
to-day studio operations can be also placed on the disk, using a system 
that's easy to understand. 
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FIGURE 7.56 

Data and hard-drive 
management (along 
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production. 
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■ Session data should likewise be logical and easy to find. Each project 
should reside in its own directory on a sepárate media drive, and each 
song should likewise reside in its own subdirectory of the session project 
directory. 

■ Remember to save various take versions of a mix. If you just added the 
vocals to a song, go ahead and save the session under a new versión ñame. 
This acts as an "undo" function that lets you go back to a specific point 
in a session. The same goes for mixdown versions. If someone likes a 
particular mix versión or effect, go ahead and save the mix under a new 
ñame or versión number (my greatest song 1 verl5.ses) or (my greatest 
song 1 ver 15 favo rite effect.ses). In fact, it's generally wise to save versions 
throughout the mix. The session files are usually small and might save 
your butt at a later point in time. 

With regard to backup schemes, a number of options also exist. Although 
making backups to optical media (CD and DVD) is a common occurrence, it's 
been found that the most robust and long-lived backups are those that reside 
on a hard drive. There's also a wise saying: "Media isn't truly backed up unless 
it's copied in three different places." All I can do is lay out a general backup 
formula that has worked extremely well for me over the years: 

■ CD/DVD: I start off with the general premise that CD and DVD media is 
temporary media that's to be used for getting media data into the hands 
of clients, friends, etc. 

■ Boot drive: The boot disk is used to store the operating system data, 
program data, program- and driver-related downloads, documents, graph- 
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ics and non-production-related media (such as MP3 music). It can also 
be used as a "local" archive drive for storing past sessions—assuming that 
you have this data adequately backed up elsewhere. 

■ Second internal or external drive: Current media data resides on this drive. 
It can also be used as an archive drive for storing all of your past sessions 
along with your current sessions. 

■ External backup drive 1: If the disk is large enough, it can be used as a 
backup drive for both your boot and second media drive. If it isn't large 
enough, you'll want to back up to múltiple drives. 

■ External backup drive 2: If the disk is large enough, it can be used as a 
second backup drive for both your boot and media data. You might 
strongly consider storing this drive offsite. In the case of a theft, fire 
or other unforeseen disaster, you'll still be able to recover your precious 
data. 

All of the above are simply suggestions. I rarely give opinions like these in a 
book; however, they've served me so well and for so long that I had to pass 
them along. 

8 Session documentation 

Most of us don't like to deal with housekeeping. But when it comes to record- 
ing and producing a project, documenting the Creative process can save your 
butt after the session dust has settled—and help make your postproduction life 
much easier (besides, you never know when something will be reissued/ 
remixed). So let's discuss how to document the details that crop up befo re, 
during and after the session. After all, the project you save might be your own! 

DOCUMENTING WITHIN THE DAW 

One of the simplest ways to document and improve a session's workflow is to 
ñame a track before you press the record button, because most DAWs will use 
that as a basis for the file ñame. For example, by naming a track "Jenny lead 
voc take 5," most DAWs will automatically save and place the newly recorded 
file into the session as "Jenny lead voc take 5.wav" (or .aif). Locating this track 
later would be a lot easier than rummaging through sound files only to find 
that the one that you want is "AudioO 18-05." Because some DAW track displays 
are limited to about 8 characters, consider putting the easily identifiable text 
first (i.e., "leadvoc-jenny take5," which might display as leadvoc-j). 

Also, make use of your DAW's notepad (Figure 7.57). Most programs offer a 
scratchpad function that lets you fill in information relating to a track or project; 
use this to ñame a specific synth patch, note the mic used on a vocal, and 
inelude other info that might come in handy after the session's specifics have 
been long forgotten. 

Markers and marker tracks can also come in super-handy. These tracks can alert 
us to mix, tempo and other kinds of changes that might be useful in the pro- 
duction process. I'll often place the lyrics into a marker track, so I can sing the 
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FIGURE 7.57 

Cubase/Nuendo 
Notepad apps. 
(Courtesy of 
Steinberg Media 
Technologies GmbH, 
www.steinberg.net.) 
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track myself without the need for a lead sheet, or to help indícate phrasings to 
another singer. 

MAKE A DOCUMENTATION DIRECTORY 

The next step toward keeping better track of details is to create a "MySong 
Documents" directory within the song's session, and fill that folder with docu- 
ments and files that relate to the session such as: 

■ Your contact info 

■ Song title and basic production notes (composer, lyricist, label, business 
and legal contacts) 

■ Producer, engineer, assistant, mastering engineer, duplication facility, etc. 
(with contact info) 

■ Original and altered tempos, tempo changes, song key, timecode settings, 
etc. 

■ Original lyrics, along with any changes (changed by whom, etc?) 

■ Additional production notes 

■ Artist and supporting cast notes (including their roles, musician costs, 
address info, etc.) 

■ Lists of any software versions and plug-in types, as well as any pertinent 
settings (you never know if they'll be available at a future time, and a 
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description and screenshot might help you to duplícate it within another 

app) 

■ Lists of budget notes and production dates (billing hours, studio rates and 
studio addresses ... anything that can help you write off the $$$) 

■ Scans of copyright forms, session contracts, studio contracts and billings 

■ Anything else that's even remotely important. 

In addition, I'll often take screenshots of some of my more complicated plug-in 
settings and place these into this "time capsule" folder. If I have to redo the 
track later for some reason, I refer to the JPG screenshot and start reconstruction. 
Photos or movie clips can also be helpful in documenting which type of mic, 
instrument and specific placements were used within a setup. You can even use 
pictures to document outboard hardware settings and patch arrangements. 
Composers can use the "Doc" folder to hold original scratchpad recordings that 
were captured on your PDA, cell phone, or message machine (I do this for 
copyright purposes). 

Furthermore, a "MySong Graphics" directory can hold the elements, pictures 
and layouts that relate to the project's artwork ... a "MySong Business" directory 
might also come in handy. 

DAW GUIDELINES 

The Producers and Engineers Wing of NARAS (the Grammy folks) are nailing 
down a wide range of guidelines that can help with aspects of documentation, 
session transfers, backups and other techno issues. At present, the P&E is offer- 
ing general DAW guidelines for Pro Tools; although a non-platform-specific 
versión is in the works, the information's still general enough for everyone. It's 
well worth downloading a copy (as well as the material on surround and 
mastering) from http://www.grammy.com/recording_academy/producers_and_ 
engineers/guidelines. 

IN CLOSING 

At this time, Ed like to refer you to the many helpful pointers in Chapter 20. 
Em doing this in the hope that you'll read this section twice (at least)— 
particularly the discussion on project preparation, session documentation and 
backup/archive strategies. I promise that the time will eventually come when 
you'll be glad you did. 
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CHAPTER 8 


Groove Tools and 
Techniques 


The expression "getting into the groove" of a piece of music often refers to a 
feeling that's derived from the underlying foundation of music: rhythm. With 
the introduction and maturation of MIDI and digital audio, new and wondrous 
tools have made their way into the mainstream of music production. These tools 
can help us to use technology to forge, fold, mutilate and create compositions 
that make direct use of rhythm and other building blocks of music through the 
use of looping technology. Of course, the cyclic nature of loops can be repeat- 
repeat-repetitive in nature, but new toys and techniques for looping can inject 
added flexibility, control and real-time processing into a project in ways that 
can be used by artists in wondrously expressive ways. 

In this chapter we'll be touching on many of the approaches and software pack- 
ages that have evolved (and continué to evolve) into what is one of the fastest 
and most accessible facets of personal music production. It's literally impossible 
to hit on the finer operational points of all of these Systems; for that, I'll rely 
on your motivation and drive to: 

■ Download many of the software demos that are readily available. 

■ Delve into their manuals and working tutorials. 

■ Begin to create your own grooves and songs that can then be integrated 
with your own music or those of collaborators. 

If you do these three things, you'll be shocked and astounded as to how much 
you've learned. And these experiences can directly transíate into skills that'll 
widen your production horizons and possibly change your music. 

THE BASICS 

Because groove-based tools often deal with rhythms and cyclic-based loops, 
there are a couple of factors that need to be managed: 

■ Sync 

■ Tempo and length. 

The aspect of sync relates to the fact that the various loops in a groove project 
will need to sync up with each other (or in múltiple lengths and timings of each 
other). It almost goes without saying that múltiple loops that are successively 
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or simultaneously triggered must have a synchronous timing relationship with 
one another—otherwise, it's a jumbled mess of sound. 

The next relationship relates to the aspect of tempo. Just as sync is imperative, 
it's also necessary for the files to be adjusted in pitch (resampling) and/or length 
(time stretching), so that they precisely match the currently selected tempo (or 
are programmed to be a relative múltiple of the session's tempo). 

Time and pitch change techniques 

The process of altering a sound file to match the current session tempo and to 
synchronously align them occurs in the software by combining variable sample 
rates with pitch-shifting techniques. Using these basic digital signal processing 
(DSP) tools, it's possible to alter a sound file's duration (varying the length of 
a program by raising or lowering its playback sample rate) or to alter its relative 
pitch (either up or down). In this way, three possible combinations of time and 
pitch change can occur: 

■ Time change: A program's length can be altered without affecting its pitch. 

■ Pitch change: A program's length can remain the same while pitch is shifted 
either up or down. 

■ Both: Both a program's pitch and length can be altered using resampling 
techniques. 

Such loop-based programs and plug-ins involve the use of recorded sound files 
that are encoded with headers that inelude information on their original tempo 
and length (in both samples and beats). By setting the loop program to a master 
tempo (or a specific tempo at that point in the song), an audio segment or file 
can be imported, examined as to sample rate or length and then recalculated 
to a new pitch and relative tempo that matches the current session tempo. Voilá! 
We now have a defined segment of audio that matches the tempo of all of the 
other segments, allowing it to play and interact in relative sync with the other 
defined segments and/or loop files. 

Warping 

Newer digital audio workstation (DAW) and loop production tools are capable 
of altering the playback speed of a sound file or segment over the course of its 
duration to match the tempo of a loop to that of the session tempo or to another 
sound file. This process, called warping , uses various time-shift techniques to 
match the timing elements of a sound file by entering hitpoints into the sound 
file. These points can be automatically detected or manually placed at percussive 
transient points or at metric time divisions in the song. Once a hitpoint has 
been entered, it can be moved in time to a place that matches the timing divi¬ 
sión marks of another track. By moving this defined point, the software is able 
to speed or slow the file's playback as it moves from one hitpoint to the next 
so as to smooth out the relative timing differences, allowing the involved tracks 
to play back in relative sync with each other. 
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Certain software DAWs are capable of addressing timing and sync in different 
ways. Instead of varying the playback speed of various sound files so that they 
match the constant tempo of a session, changes in the tempo of a session can 
be varied to match the timing changes that exist in a recorded track. By placing 
tempo changes at their appropriate places, the beats per minute (bpm) will be 
shifted to match timing shifts in the recording. This means that the click will 
change to reflect the varying recorded tempo—meaning that the MIDI tracks, 
effects and instrument timings will be automatically adjusted to the varying 
tempo over the course of a song. 

Beat slicing 

Most loop-based production tools use a variety of time stretching and pitch- 
shifting algorithms to provide for intelligent beat matching in order to alter the 
length of a loop or defined segment so it matches the timing elements (gener- 
ally denoted in beats per minute) of the other loops within the session. Another 
method, called beat slicing, makes use of an entirely different process to match 
the length and timings of a sound file segment to the session tempo. Rather 
than changing the speed and pitch of a sound file, the beat slicing process actu- 
ally breaks an audio file into a number of small segments (not surprisingly 
called slices) and then changes the length and timing elements of a segment by 
adding or subtracting time between these slices. This procedure has the advan- 
tage of preserving the pitch, timbre and sound quality of a file but has the 
potential downside of creating silent breaks in the sound file. From this, it's easy 
to understand why the beat-slicing process often works best on percussive 
sounds that have silence between the individual hitpoints. This process is 
usually carried out by detecting the transient events within the loop and then 
automatically placing the slices at their appropriate points (often according to 
user-definable sensitivity and detection Controls). 

As with so many other things we've discussed, the sky (and your imagination) 
is the limit when it comes to the tricks and techniques that can be used to match 
the tempos and various time-/pitch-shift elements between loops, MIDI and 
sound file segments. I strongly urge you to take the time to read the manuals 
of various loop and DAW software packages to learn more about the actual 
terms and procedures and then put them into practice. If you take the time, I 
guarantee that your production skills and your outlook on these tools will be 
greatly expanded. 

For the remainder of this chapter, we'll be looking at several of the more popular 
groove tools and toys. This is by no means a complete listing, and I recommend 
that you keep reading the trade magazines, websites, and other resources, 
because new and exciting technologies come onto the market regularly. 

LOOPING YOUR DAW 

Most digital audio workstations offer various features that can make it possible 
to incorpórate looping into a session, along with other track-based sound, MIDI 
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and video files. Even when features that could make a specific task much easier 
aren't available, it's often possible to think through the situation and find 
workarounds that can help tackle the problem at hand. For example, a number 
of workstations allow the beginning point of an edited segment to be manually 
placed at a specific point. And, by simply clicking on and dragging the tail 
end of the segment, the sound file can be time-stretched into the session's 
proper time relationship (while the pitch remains unchanged). Said another 
way, if a session has been set to a tempo of 94 bpm, an 88-bpm loop can be 
imported at a specific measure. Then, by turning on the DAW's snap-to-grid and 
automatic time-stretch functions, the segment can be time-stretched until it 
snuggly fits into the session's native tempo (Figure 8.1). Now that the loop fits 
the current tempo, it can be manually looped (i.e., copied) to your heart's 
content. 

As was mentioned, different DAWs and editing systems will have differing 
ways of tackling a situation and with varying degrees of ease. One of the 
best ways to avoid pitfalls is to set your session tempo (or varying tempo map) 
and click track at the beginning of a session and then manually adjust your 
loop timings to fit that tempo. Just remember, there are usually no hard 
and fast rules. With planning, ingenuity and your manual's help, you'll be 
surprised at the number of ways that a looping problem can be turned into an 
opportunity. 


FIGURE 8.1 

Most production 
workstations allow a 
loop or sound file to 
be stretched to 
match the session’s 
tempo. 



loop stretched 
to match tempo 


o o nn 

O EJCiZfU 
C3 C51 Cüü 
nri tHKI 

o o on 

Q DK 2 E 3 

LJ k3 t¡¡3 

■ ■« i tr>ord 

O O □□ 
Q OC 3 Q 

n sn is 


o o on 
Qsrza 

n ¡r=i E 3 


Al AJI 

Jrin 


•> 

jSjSJagJ 


I H | l 

ll£l¿ 




loop doesfVt maten 
session tempo 








































Groove Tools and Techniques A APTE R 8 


B I Y 

do it yourself 


Tutorial: Manual Looping uiith a DAW 


1. Go to www.acidplanet.com and download an ACID 
8pack set of loops. 

2. Load the individual loops into your favorite DAW. 

3. Set the time display to read in bars and beats (tempo). 

4. Import the loops into the session. 

5. Try to match the session tempo to a bpm setting that 
seems to match the 8pack’s intended tempo. 


6. It’s possible (but not likely) that the loop lengths won’t 
match up. If this happens, consult your DAW’s manual 
in order to manually time-stretch the loops to fit the 
desired session tempo. 

7. Copy and duplícate the various loop tracks, until 
you’ve made a really great song! 

8. Save the session to play for your friends! 


Loop-based audio software 

Loop-based audio editors are groove-driven music programs (Figures 8.2 and 
8.3) that are designed to letyou drag and drop prerecorded or user-created loops 
and audio tracks into a graphic multitrack production interface. At their basic 
level, these programs differ conceptually from their traditional DAW counter- 
part in that the pitch- and time-shift architecture is so variable and dynamic 
that, even after the basic rhythmic, percussive and melodic grooves have been 
created, their tempo, track patterns, pitch, session key, etc., can be quickly and 
easily changed at any time. With the help of custom, royalty-free loops (avail- 
able from many manufacturer and third-party companies), users can quickly 
and easily experiment with setting up grooves, backing tracks and creating a 
sonic ambience by simply dragging the loops into the program's main sound 
file view, where they can be arranged, edited, processed, saved and exported. 

One of the most interesting aspects of the loop-based editor is its ability to 
match the tempo of a specially programmed loop to the tempo of the current 
session. Amazingly enough, this process isn't that difficult to perform, because 
the program extracts the length, native tempo, and pitch information from the 
imported file's header and, using various digital time- and pitch-change tech- 
niques, adjusts the loop to fit the native time and pitch parameters of the current 
session. This means that loops of various tempos and musical keys can be 
automatically adjusted in length and pitch so as to fit in time with previously 
existing loops ... just drag, drop and go! 

Behind the scenes, these shifts in time can be performed in a number of ways. 
For example, using basic DSP techniques to time-stretch and pitch-shift, a 
recorded loop will often work well over a given plus-or-minus percentage range 
(which is often dependent on the quality of the program algorithms). Beyond 
this range, the loop will often begin to distort and become jittery. At such 
extremes, other playback algorithms and beat slice detection techniques can be 
used to make the loop sound more natural. For example, drums or percussion 
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FIGURE 8.2 

Apple GarageBand™. (Courtesy of Apple Computers, Inc., www.apple.com.) 


can be stretched in time by adding additional silence between the various hit- 
points within the loop (slicing), at precisely calculated intervals. In this way, 
the pitch will remain the same while the length is altered. Of course, such a 
loop would sound choppy and broken up when played on its own; however, 
when buried within a mix, it might work just fine. It's all up to you and the 
current musical context. 

0 I Y 

do it yourself 


Tutorial: Having Fun uiith a Loop-Based Editor 


1. Go to www.acidplanet.com and download their free 
versión of ACID Xpress. 

2. Download an ACID 8pack set of loops. 

3. Load the individual loops into ACID Xpress (or a 
loop-based editor that can read ACID files). 

4. Read the program’s manual and begin to experiment 
with the loops. 


5. Mess around with the tempo and musical keys 

6. Copy and duplícate the various loop tracks to 
your heart’s contení until you’ve made a really 
great song! 

7. Save the session to play for your friends! 
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FIGURE 8.3 

ACID Pro professional music workstation. (Courtesy of Sony Media Software, a división of Sony Corporation of America, 
www.sonycreativesoftware.com.) 


Of course, the graphic user interfaces (GUIs) between looping software editors 
and tools can differ greatly. Most layouts use a track-based system that lets you 
enter or drag a preprogrammed loop file into a track and then drag it to the 
right in a way that repeats the loop in a traditional way. Again, it's worth stress- 
ing that DAW editors will often inelude such looping functions that can be basic 
(requiring manual editing or sound file processing) or advanced in nature 
(having any number of automated loop functions). 

Other loop programs make use of visual objeets that can be triggered and com- 
bined into a real-time mix by clicking on an icón or track-based grid. One such 
program is Live from Ableton Software (Figure 8.4). Live is an interactive loop- 
based program that's capable of recalculating the time, pitch and tempo struc- 
ture of a sound file or easily defined segment, and then entering that loop into 
the session at a defined global tempo. Basically, this means that a segment of 
any length or tempo that's been pulled into the session grid will be recalculated 
to the master tempo and can be combined, mixed and processed in perfect sync 
with all other loops in the project session. 
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FIGURE 8.4 

Ableton Uve 
performance audio 
workstation: (a) 
Arrangement View; 
(b) Session View. 
(Courtesy of Ableton, 
www.ableton.com.) 
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In Live's Arrangement View, as in all traditional sequencing programs, every- 
thing happens along a fixed song timeline. The program's Session View breaks 
this limiting paradigm by allowing media files to be mapped onto a grid as 
buttons (cabed clips). Any clip can be played at any time and in any order in 
a random fashion that lends itself to interactive performance both in the studio 
and on-stage. Functionally, each vertical column, or track, can play only one 
clip at a time. The horizontal rows are cabed scenes. Any clip can be played in 
a loop fashion by clicking on its launch button. By clicking on a scene launch 
at the screen's right, every clip in a row will simultaneously play beyond the 
wide range of timing, edit, effects, MIDI and instrument Controls. 

Of course, most of the looping software packages are capable of incorporating 
MIDI into a project or live interactive performance. This can be done through 
the creation of a MIDI sequence or drum pattern track, allowing performance 
data to be sequenced in a traditional fashion, or by allowing MIDI pattern loops 
to be mixed in with other media. When a loop DAW or editor is used, external 
MIDI hardware and various instrument plug-ins can easily play a prominent 
role in the Creative process. 

Reason 

If there's a software package that has gripped the hearts and minds of electronic 
musicians in the 21st century, it would have to be Reason from the folks at 
Propellerheads (Figure 8.5). Reason defies specific classification in that it's an 
overall music production environment that has many facets. For example, it 
ineludes a MIDI sequencer (Figure 8.6, possibly the most widely used sequencer 
of them all), as web as a wide range of software instrument modules that can 
be played, mixed and combined in a comprehensive environment that can be 
controlled from any external keyboard or MIDI controller. Reason also ineludes 
a large number of signal processors that can be applied to any instrument or 
instrument group, under full and easily controlled automation. 

In essence, Reason is a combination of modeled representations of vintage 
analog synthesis gear, mixed with the latest of digital synthesis and sampling 
technology (Figures 8.7 and 8.8). Combine these with a modular approach to 
signal and effects processing, add a generous amount of internal and remóte 
mix and controller management (via an external MIDI controller), top this off 
with a quirky but powerful sequencer, and you have a software package that's 
powerful enough for top-flight production and convenient enough that you can 
build tracks using your laptop in your seat on a crowded plañe. I know that it 
sounds like I read this off of a sales brochure, but these are the basic faets of 
this program. When asked to explain Reason to others I'm often at a loss, 
because the basic structure is so open ended and flexible that the program can 
be approached in as many ways as there are people who produce on it. That's 
not to say that Reason doesn't have a signature sound—it usually does. It's an 
amazing tool that can be either used on its own or in combination with other 
production instruments and tools. 
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FIGURE 8.5 

Reason music 
production environ- 
ment. (Courtesy of 
Propellerhead 
Software, www. 
propellerheads.se.) 
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FIGURE 8.6 

Reason’s sequencing 
window. (Courtesy of 
Propellerhead 
Software, www. 
propellerheads.se.) 
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FIGURE 8.7 

Reason’s SubTractor 
polyphonic synth 
module. (Courtesy of 
Propellerhead 
Software, www. 
propellerheads.se.) 



FIGURE 8.8 

Reason’s NN-XT 
sampler module. 
(Courtesy of 
Propellerhead 
Software, www. 
propellerheads.se.) 


In short, Reason functions by: 

■ Allowing you to choose a virtual instrument (or combination of 
instruments) 

■ Calling up a programmed sample loop or an instrument performance 
patch 

■ Allowing these sounds and sequence patterns to be looped and edited in 
new and unique ways 

■ Letting you create controller tracks that can vary and process the track in 
new and innovative ways 
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■ Allowing a wide range of synchronized and controllable effects to be 
added to the track mix. 

Of course, once you've finished the outline of a track, the obvious idea is to 
create new instrument tracks that can be combined in a traditional multitrack 
building-block approach, until a song begins to form. Since the loops and 
instruments are built from preprogrammed sounds and patches that reside on 
the hard disk, the actual Reason file (.rns) is often quite small, making this a 
perfect production media for publishing your songs on the Web or for collabo- 
rating with others around the world. Additional sounds, loops and patches are 
widely available for sale or for free as "refills" that can be added to your collec- 
tion to greatly expand the software's palette. 

I definitely suggest that you download the demo and take it for a spin. You'll 
also want to check out the "New to Reason" video clips. Due to its open-ended 
nature, this program might take a while to get used to, but the journey just 
might open you up to a whole new world of production possibilities. 

ReWire 

In Chapter 7, we learned that ReWire and ReWire2 are special protocols that 
were developed by Propellerhead Software and Steinberg to allow audio to be 
streamed between two simultaneously running Computer applications. Unlike 
a plug-in, where a task-specific application is inserted into a compatible host 
program, ReWire allows the audio and timing elements of a supporting program 
to be seamlessly integrated into another host program that supports ReWire. 
For example, both Cubase/Nuendo and Pro Tools are examples of DAWs that 
support ReWire, while Live and Reason are looping production tools that 
support the application. Because Reason doesn't support the recording of live 
audio tracks, it could be ReWired into either host DAW, allowing the audio to 
be routed through the DAW's virtual mixer or I/O, while the timing elements 
could be synchronously controlled by the host DAW. Likewise, Reason could be 
plugged into Live, allowing for greatly expanded instrument and production 
options in a studio or on-stage environment. 

If you feel up to the task, download a few program demos, consult the support¬ 
ing program manuals, and try it out for yourself. The most important rule to 
remember when using ReWire is that the host program should always be opened 
first, then the client. When shutting down, the client program should always be 
closed first. 

On a final ReWire note—the virtual instruments within Reason offer up a pow- 
erful set of instruments that can be ReWired into the MIDI and performance 
paths of a DAW. In this way, one or more virtual instruments could be played 
into the DAW from your controller, allowing you to add more instruments and 
tonalities to your production. Once you've finished tracking and the MIDI data 
has been saved to their respective tracks, the ReWired instruments could be 
exported (bounced) to audio tracks within the DAW. 
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GROOVE AND LOOP HARDWARE 

The software world doesn't actually hold the total patent on looping tools and 
toys; there are also a number of hardware groove keyboards and module boxes 
on the market. These Systems, which range widely in sounds, functionality and 
price, can offer up a vast range of unique sounds that can be quite useful for 
laying a foundation under your production. 

In the past, getting a hardware grove tool to sync into a session could be time 
consuming, frustrating and problematic, requiring time and tons of manual 
reading; however, with the advent of powerful time- and pitch-shift processing 
within most DAWs, the sounds from these hardware devices can be pulled into 
a session without too much trouble. For example, a single groove loop (or 
múltiple loops) could be recorded into a DAW (at a bpm that's cióse to the 
session's tempo), edited and then imported into the session, at which time 
the loop could be easily stretched into time sync, allowing it to be looped to 
your heart's content. Just remember, necessity is the mother of invention— 
patience and creativity are probably your most important tools in the looping 
process. 

Groove and loop plug-ins 

Of course, it's a sure bet that for every hardware looping tool, there are far 
more software plug-in groove tools and toys (Figures 8.9 and 8.10) that can 
be inserted into your DAW. These amazing software wonders often make life 
easier by: 

■ Automatically following the session tempo 

■ Allowing I/O routing to plug into the DAW's mixer 

■ Making use of the DAW's automation and external controller capabilities 



FIGURE 8.9 

Stylus RMX real-time 
groove module. 
(Courtesy of 
Spectrasonics, www. 
spectrasonics.net.) 
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FIGURE 8.10 

Beatscape groove 
module. (Courtesy of 
Cakewalk, Inc., 
www.cakewalk.com.) 
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FIGURE 8.11 

Edit screen for 
Rapture instrument 
plug-in. (Courtesy of 
Cakewalk, Inc., 
www.cakewalk.com.) 
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■ Allowing individual or combined groove loops to be imported into a 
session. 

These software instruments come in a wide range of sounds and applications 
that can often be edited and effected using an on-screen user interface. This 
interface can often be remotely controlled from an external MIDI controller, as 
shown in Figure 8.11. 
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Drum and drum loop plug-ins 

Virtual software drum and groove machines are also part of the present-day 
landscape and can be used in a stand-alone, plugged-in or rewired production 
environment. 

These plug-ins (Figures 8.12 and 8.13) are cap able of injecting a wide range 
of groove and sonic spice options into a digital audio and/or MIDI project at 
the session's current tempo, allowing accompaniment patterns that can range 
from being simple and nonvarying over time, to individually complex instru- 
ment parís that can be meticulously programmed into a session or performed 
on the fly. In addition, most of these tools inelude múltiple signal paths that 
let you route individual or groups of voices to a specific mixer input or DAW 
input. This makes it possible for isolated voices to be individually mixed, 
panned or processed (using equalization, effeets, etc.) into a session in new and 
interesting ways. 

Pulling loops into a DAW session 

When dealing with loops in modern-day production, one concept that needs 
to be discussed is that of importing loops into a DAW session. As we've seen, 
it's certainly possible to use the ReWire application to run a supporting client 
program in conjunction with the host program or even to use an instrument or 
groove-based plug-in within a compatible session. However, it should be noted 
that often (but not always) some sort of setup might be required in order to 
properly configure the application. Or at worst, these applications might use 
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FIGURE 8.12 

Steinberg’s Groove 
Agent 3 drum 
plug-in. (Courtesy of 
Steinberg Media 
Technologies GmbH, 
a división of Yamaha 
Corporation, www. 
steinberg.net.) 






































DJ Software 


FIGURE 8.13 

BFD2 acoustic drum 
library module. 
(Courtesy of 
FXpansion, www. 
fxpansion.com.) 
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up valuable DSP resources that could otherwise be used for effects and signal 
Processing. 

One of the best options for freeing up these resources is to export the instru- 
ment or groove to a new audio track. This can be done in several ways (although 
with forethought, you might be able to come up with new ways of your own): 

■ The instrument track can be soloed and exported to an audio track in a 
contiguous fashion from the beginning "00:00:00:0o" of the session to 
the end of the instrumenté performance. 

■ In the case of a repetitive loop, a defined segment (often of a precise length 
of 4, 8, or more bars that occurs on the metric boundaries) can be selected 
for export as a sound file loop. Once exported to the file, it can be 
imported back into the session and looped into the session. 

■ In the case of an instrument that has múltiple parts or voices, each part 
can be soloed and exported to its own track—thus giving a greater degree 
of control during a mixdown session. 

DJ SOFTWARE 

In addition to music production software, there are a growing number of soft¬ 
ware players, loopers, groovers, effects and digital devices on the market for the 
21st century digital DJ. These hardware/software devices make it possible for 
digital grooves to be created from a laptop, controller, specially fitted turntable 
or digital turntable (jog/scratch CD player). Using such hardware-/software- 
based Systems, it's possible to sync, scratch and perform with vinyl within a 
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FIGURE 8.14 

Torq DJ performance 
production software 
with Conectiv USB 
DJ interface. 
(Courtesy of M-Audio, 
a división of Avid 
Technology, Inc., 
www.m-audio.com.) 


synchronized digital environment with an unprecedented amount of pre- 
programmable or live performance interactivity that can be used on the floor, 
on-stage or in the studio (Figure 8.14). 

OBTAINING LOOP FILES FROM 
THE GREAT DIGITAL WELLSPRING 

In this day and age, there's absolutely no shortage of preprogrammed loops that 
will work with a number of DAWs and groove editors. Some sound files will 
need to be manually edited or programmed to work with your system. Others 
(such as Sony's ACID format) are already preprogrammed and can be imported 
into a number of loop-based production editors. Either way, these files can be 
easily obtained from any number of sources, such as: 

■ Those that are included for free with newly purchased software 

■ The Web (both free and for purchase) 

■ Commercial CDs 

■ Free files within CDs that are loaded onto demo CDs and magazine CD 
content 

■ Rolling your own (creating your own loops can add a satisfying and per¬ 
sonal touch). 

It's important to note that at any point during the creation of a composition, 
audio and MIDI tracks (such as vocals or played instruments) can often be easily 
recorded into a loop session in order to give the performance a fluid and more 
dynamic feel. It's even possible to record a live instrument into a session with 
a defined tempo and then edit these tracks into defined loops that can be 
dropped into the current and future sessions to add a live touch. 




Obtaining Loop Files from the Great Digital Wellspring 


As with most music technologies, the field of looping and laying tracks into a 
groove continúes to advance and evolve at an alarming rate. It's almost a sure 
bet that your current system will support looping, one way or another. Take 
time to read the manuals, gather up some loops that fit your style and particu¬ 
lar interests and start working them into a session. It might take you some time 
to master the art of looping—and then again you might be a natural Zen master. 
Either way, the journey is educational and tons of fun! 


CHAPTER 9 


MIDI and Electronic 
Music Technology 


Today, professional and nonprofessional musicians are using the language of 
the Musical Instrument Digital Interface (MIDI) to perform an expanding range 
of music and automation tasks within audio production, audio for video, 
film post, stage production, etc. This industry-wide acceptance can, in large 
part, be attributed to the cost effectiveness, power and general speed of MIDI 
production. Once a MIDI instrument or device comes into the production 
picture, there's often less need (if any at all) to hire outside musicians for a 
project. This alluring factor allows a musician to compose, edit and arrange a 
piece in an electronic music environment that's extremely flexible. By this, Y m 
not saying that MIDI replaces (or should replace) the need for acoustic instru- 
ments, microphones and the traditional performance setting. In fact, it's a 
powerful production tool that assists countless musicians in creating music and 
audio productions in ways that are innovative and highly personal. In short, 
MIDI is all about control, repeatability, flexibility, cost-effective production 
power and fun. 

The affordable potential for future expansión and increased control over an 
integrated production system has spawned the growth of an industry that allows 
an individual to cost effectively realize a full-scale sound production, not only 
in his or her own lifetime ... but in a relatively short time. For example, much 
of modern-day film music owes its very existence to MIDI. Befo re this technol- 
ogy, composers were forced to create without the benefits of hearing their 
composition or by creating a reduction score that could only be played on a 
piano or small ensemble (due to the cost and politics of hiring an orchestra). 
With the help of MIDI, composers can hear their work in real time, make any 
necessary changes, print out the scores ... and take a full orchestra into the 
studio to record the final score versión. At the other end of the spectrum, MIDI 
can be an extremely personal tool that lets us perform, edit and layer synthe- 
sized and/or sampled instruments to create a song that helps us to express 
ourselves to the masses ... all within the comfort of the home or personal 
project studio. The moral of this story is that today's music industry would look 
and sound very different if it weren't for this powerful, four-letter production 
word. 



MIDI Production Environments 


MIDI PRODUCTION ENVIRONMENTS 

One of the more powerful aspects of MIDI production is that a system can be 
designed to handle a wide range of tasks with a degree of flexibility and ease 
that best suits an artist's main instrument, playing style and even the user's 
personal working habits. By opening up almost any industry-related magazine, 
you'll easily see that a vast number of electronic musical instruments, effects 
devices, Computer systems and other MIDI-related devices are currently avail- 
able on the new and used electronic music market. MIDI production systems 
exist in all kinds of shapes and sizes and can be incorporated to match a wide 
range of production and budget needs. For example, working and aspiring 
musicians commonly install digital audio and MIDI systems in their homes 
(Figure 9.1). These production environments can range from ones that take up 
a córner of an artist's bedroom to larger systems that are integrated into a 
dedicated project studio. Systems such as these can be specially designed to 
handle a multitude of applications and have the important advantage of letting 
artists produce their music in a comfortable environment ... whenever the Cre¬ 
ative mood hits. Newer, laptop-based systems allow us to make music "wher- 
ever and whenever" from the comfort of your trusty backpack. Such production 
luxuries, that would have literally cost an artist a fortune in the not-too-distant 
past, are now within the reach of almost every musician. 

Once MIDI has been mastered, its repeatability and edit control offer produc¬ 
tion challenges and possibilities that can stretch beyond the capabilities and 
cost effectiveness of the traditional multitrack recording environment. When 
combined with digital audio workstations (DAWs) and modern-day recording 
technology, much of the music production process can be preplanned and 
rehearsed before you step into the studio. In fact, it's not uncommon for 


FIGURE 9.1 

GettirT it all going in 
the bedroom studio. 
(Courtesy of M-Audio, 
a división of Avid 
Technology, Inc., 
www.m-audio.com.) 
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FIGURE 9.2 

Art ’n Audio Studio, 
Thailand. (Courtesy 
of Solid State Logic, 
www.solid-state- 
logic.com.) 


recorded tracks to be laid down before they ever see the hallowed halls of the 
professional studio (if they see them at all). In business jargón, this luxury has 
reduced the number of billable hours to a cost-effective mínimum. 

Since its inception, electronic music has been an indispensable tool for the 
scoring and audio postproduction of televisión and radio commercials, indus¬ 
trial videos and full-feature motion picture sound tracks (Figure 9.2). For pro- 
ductions that are on a budget, an entire score can be created in the artist's project 
studio using MIDI, hard-disk tracks and digital recorders—all at a mere fraction 
of what it might otherwise cost to hire the musicians and rent a studio. 

Electronic music production and MIDI are also very much at home on the stage. 
In addition to using synths, samplers, DAWs and drum machines on the stage, 
most or all of a MIDI instrument and effects device parameters can be controlled 
from a presequenced or real-time controller source. This means that all the 
necessary settings for the next song (or section of a song) can be automatically 
called up before being played. Once under way, various instrument patch and 
controller parameters can also be changed during a live performance from a 
stomp box controller, keyboard or other hardware controller. 

One of the media types that can be included in the notion of multimedia is 
definitely MIDI. With the advent of General MIDI (GM, a standardized spec 
that allows any soundcard or GM-compatible device to play back a score using 
the originally intended sounds and program settings), it's possible (and 
common) for MIDI scores to be integrated into multimedia games and websites. 

With the integration of the General MIDI standard into various media devices, 
one of the fastest growing MIDI applications, surprisingly, is probably comfort- 
ably resting in your pocket or purse right now—the ring tone on your cell phone 
(Figure 9.3). The ability to use MIDI (and often digital sound files) to let you 
know who is calling has spawned an industry that allows your cell to be per- 


















What is MIDI? 


FIGURE 9.3 

One ringy-dingy ... 
MIDI helps us to 
reach out and touch 
someone through 
ring tones. 



sonalized and super fun. One of my favorite ring tone stories happened on 
Hollywood Boulevard in L.A. This tall, lanky man was sitting at a café when his 
cell phone started blaring out the "If I Only Had a Brain" sequence from The 
Wizard of Oz. It wouldn't have been nearly as funny if the guy didn't look a lot 
like the scarecrow character. Of course, everyone laughed. 

WHAT IS MIDI? 

Simply stated, the Musical Instrument Digital Interface (MIDI) is a digital Com¬ 
munications language and compatible specification that allows múltiple hard¬ 
ware and software electronic instruments, performance controllers, computers 
and other related devices to communicate with each other over a connected 
network (Figure 9.4). MIDI is used to transíate performance- or control-related 
events (such as playing a keyboard, selecting a patch number, varying a modu- 
lation wheel, triggering a staged visual effect, etc.) into equivalent digital mes- 
sages and then transmit these messages to other MIDI devices where they can 
be used to control sound generators and other performance/control parameters. 
The beauty of MIDI is that its data can be recorded into a hardware device or 
software program (known as a sequencer), where it can be edited and transmit- 
ted to electronic instruments or other devices to create music or control any 
number of parameters in a postperformance setting. 

In addition to composing and performing a song, musicians can also act as 
techno-conductors, having complete control over a wide palette of sounds, their 
timbre (sound and tonal quality), overall blend (level, panning) and other real- 
time Controls. MIDI can also be used to vary the performance and control 
parameters of electronic instruments, recording devices, control devices and 
signal processors in the studio, on the road or on the stage. 
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FIGURE 9.4 

Example of a typical 
MIDI system with 
the MIDI network 
connections 
highlighted. 


The term interface refers to the actual data Communications link and software/ 
hardware Systems in a connected MIDI network. Through the use of MIDI, it's 
possible for all of the electronic instruments and devices within a network to 
be addressed through the transmission of real-time performance and control- 
related MIDI data messages throughout a system to múltiple instruments and 
devices through one or more data lines (which can be chained from device to 
device). This is possible because a single data cable is capable of transmitting 
performance and control messages over 16 discrete channels. This simple fact 
allows electronic musicians to record, overdub, mix and play back their perfor¬ 
mances in a working environment that loosely resembles the multitrack record- 
ing process. Once mastered, MIDI surpasses this analogy by allowing a 
composition to be edited, controlled, altered and called up with complete 
automation and repeatability—all of this providing production challenges and 
possibilities that are web beyond the capabilities of the traditional tape-based 
multitrack recording process. 

What MIDI isn’t 

For starters, let's dispel one of MIDI's greatest myths: MIDI DOESN'T commu- 
nicate audio, ñor can it create sounds! It is strictly a digital language that 
instructs a device or program to create, play back or alter the parameters of 
sound or control function. It is a data protocol that communicates on/off 




































































System Interconnections 


FIGURE 9.5 

Example of a typical 
MIDI system with the 
audio connections 
highlighted. 


microphones 


Computer 




instruments 


mil 


. 


* • «• 

>4P» 


audio mixer 



€€€€ € 


audio interface with MIDI l/O 


MIDI controller 


r 



MIDI module 

€€ 


C oooc 


MIDI module 

€€ 


© cocc 


eeeeeeoo 

00990900 

eeoeeeee 
. oeeeeeee 
r oeeeeeee 
t oeeeeeee 


MIDI module 

€€ 


C cocc 


♦ 





powered speakers 


triggering and a wide range of parameters to instruct an instrument or device 
to generate, reproduce or control audio or production-related functions. Because 
of these differences, the MIDI data path and the audio routing paths are entirely 
sepárate from each another (Figure 9.5). Even if they digitally share the same 
transmission cable (such as through FireWire or USB), the actual data paths and 
formats are completely sepárate. 

In short, MIDI communicates information that instructs an instrument to play 
or a device to carry out a function. It can be likened to the dots on a player- 
piano roll; when we put the paper roll up to our ears, we hear nothing, but 
when the cut-out dots pass over the sensors on a player piano, the instrument 
itself begins to make beautiful music. It's exactly the same with MIDI. A MIDI 
file or datastream is simply a set of instructions that pass down a wire in a serial 
fashion, but when an electronic instrument interprets the data, we begin to hear 
sound. 

SYSTEM INTERCONNECTIONS 

As a data transmission médium, MIDI is relatively unique in the world of sound 
production in that it's able to pack 16 discrete channels of performance, control¬ 
ler and timing information and then transmit it in one direction, using data 
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densities that are economically small and easy to manage. In this way, it's pos- 
sible for MIDI messages to be communicated from a specific source (such as a 
keyboard or MIDI sequencer) to any number of devices within a connected 
network over a single MIDI data chain. In addition, MIDI is flexible enough 
that múltiple MIDI data lines can be used to interconnect devices in a wide 
range of possible system configurations; for example, múltiple MIDI lines can 
be used to transmit data to instruments and devices over 32, 48, 128 or more 
discrete MIDI channels! 


The MIDI cable 

A MIDI cable (Figure 9.6) consists of a shielded, twisted pair of conductor 
wires that has a male 5-pin DIN plug located at each of its ends. The MIDI 




FIGURE 9.6 

The MIDI cable: (a) 
wiring diagram; (b) 
the cable itself. 










Midi Phantom Power 


specification currently uses only three of the five pins, with pins 4 and 5 being 
used as conductors for MIDI data; pin 2 is used to connect the cable's shield to 
equipment ground. Pins 1 and 3 are currently not in use, although the next 
section describes an ingenious system for powering devices through these pins 
that's known as MIDI phantom power. The cables use twisted cable and metal 
shield groundings to reduce outside interference, such as radio-frequency inter- 
ference (RFI) or electrostatic interference, both of which can serve to distort or 
disrupt the transmission of MIDI messages. 


MIDI Pin Description 


Pin 1 ¡s not used in most cases; however, ¡t can be 
used to provide the V- (ground retum) of a MIDI 
phantom power supply. 

Pin 2 is connected to the shield or ground cable, which 
protects the signal from radio and electromagnetic 
interference. 


■ Pin 3 is not used in most cases; however, ¡t can be 
used to provide the V+ (+9 to +15V) of a MIDI phantom 
power supply. 

■ Pin 4 is a MIDI data line. 

■ Pin 5 is a MIDI data line. 


MIDI cables come prefabricated in lengths of 2, 6, 10, 20 and 50 feet and can 
commonly be obtained from music stores that specialize in MIDI equipment. 
To reduce signal degradations and external interference that tends to occur over 
extended cable runs, 50 feet is the máximum length specified by the MIDI spec. 
(As an insider tip, I found Radio Shack to be a great source for picking up shorter 
MIDI cables at a fraction of what you'd sometimes spend at a music store.) 

MIDI PHANTOM POWER 

In December 1989, Craig Anderton (musician, audio guru and editor of Elec¬ 
tronic Musician magazine) submitted an article to EM proposing an idea that 
provides a standardized 12-V DC power supply to instruments and MIDI devices 
directly through pins 1 and 3 of a basic MIDI cable. Although pins 1 and 3 are 
technically reserved for possible changes in future MIDI applications (which 
never really carne about), over the years several forward-thinking manufacturers 
(and project enthusiasts) have begun to implement MIDI phantom power 
(Figure 9.7) directly into their studio and on-stage Systems. 

For the more adventurous types who aren't afraid to get their soldering irons 
hot, Ive included a basic schematic for getting rid of your wall wart and power¬ 
ing a system through the MIDI cable itself. It should be noted that not all MIDI 
cables connect all five pins. Some manufacturers use only three wires to save 
on manufacturing costs, thus it's best to make sure what you've got before you 
delve into your own personal project. Of course, neither the publisher ñor I 
extend any warranty to anyone regarding damage that might occur as a result 
of this modification. Think it out carefully, have a steady hand and don't blow 
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anything up! (On a personal note, I've retrofitted my own MIDI guitar rig, so 
the wall wart's off-stage, eliminating the need for a nearby power plug ... which 
I'd probably kick into the next State in the middle of a gig.) 
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FIGURE 9.7 

MIDI phantom power 
wiring diagram. 


FIGURE 9.8 

MidAir 25 25-key 
wireless USB MIDI 
controller and 
receiver. (Courtesy of 
M-Audio, a división 
of Avid Technology, 
Inc., www.m-audio. 
com.) 


WIRELESS MIDI 

Several companies have begun to manufacture wireless MIDI transmitters 
(Figure 9.8) that allow a battery-operated MIDI guitar, wind controller, etc., to 





MIDI Jacks 


be footloose and fancy free on-stage and in the studio. Working at distances of 
up to 500 feet, these battery-powered transmitter/receiver systems introduce 
very low delay latencies and can be switched over a number of radio channel 
frequencies. 

MIDI JACKS 

MIDI is distributed from device to device using three types of MIDI jacks: MIDI 
In, MIDI Out and MIDI Thru (Figure 9.9). These three connectors use 5-pin 
DIN jacks as a way to connect MIDI instruments, devices and computers into 
a music or production network system. As a side note, it's nice to know that 
these ports (as strictly defined by the MIDI 1.0 spec) are optically isolated to 
eliminate possible ground loops that might occur when connecting numerous 
devices together. 

■ MIDI In jack: The MIDI In jack receives messages from an external source 
and communicates this performance, control and timing data to the 
device's internal microprocessor, allowing an instrument to be played or 
a device to be controlled. More than one MIDI In jack can be designed 
into a system to provide for MIDI merging functions or for devices that 
can support more than 16 channels (such as a MIDI interface). Other 
devices (such as a controller) might not have a MIDI In jack at all. 

■ MIDI Out jack: The MIDI Out jack is used to transmit MIDI performance, 
control messages or SysEx from one device to another MIDI instrument 
or device. More than one MIDI Out jack can be designed into a system, 
giving it the advantage of controlling and distributing data over múltiple 
MIDI paths using more than just 16 channels (i.e., 16 channels x N MIDI 
port paths). 


FIGURE 9.9 

MIDI In, Out and Thru 
ports, showing the 
device’s signal path 
routing. 


Device Microprocessor 
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■ MIDI Thrujack: The MIDI Thru jack retransmits an exact 
copy of the data that's being received at the MIDI In jack. 
This process is important, because it allows data to pass 
directly through an instrument or device to the next 
device in the MIDI chain. Keep in mind that this jack is 
used to relay an exact copy of the MIDI In datastream, 
which isn't merged with data being transmitted from the 
MIDI Out jack. 

MIDI ECHO 


mieroprocessor 

□ □ □ d o □ □ □ □ □ o 

echo off 


echo on 


□ □ 


□□□□□□□c □ 



OUT/ECHO 


Certain MIDI devices may not inelude a MIDI Thru jack at all. 

Some of these devices, however, may give the option of switch- 
ing the MIDI Out between being an actual MIDI Out jack and 
a MIDI Echo jack (Figure 9.10). As with the MIDI Thru jack, a 
MIDI Echo option can be used to retransmit an exact copy of any information 
that's received at the MIDI In port and route this data to the MIDI Out/Echo 
jack. Unlike a dedicated MIDI Out jack, the MIDI Echo function can often be 
selected to merge incoming data with performance data that's being generated 
by the device itself. In this way, more than one controller can be placed in a 
MIDI system at one time. Note that, although performance and timing data can 
be echoed to a MIDI Out/Echo jack, not all devices are capable of echoing SysEx 
data. 



FIGURE 9.10 

MIDI echo 
configuration. 


TYPICAL CONFIGURATIONS 

Although electronic studio production equipment and setups are rarely alike 
(or even similar), there are a number of general rules that make it easy for MIDI 
devices to be connected to a functional network. These common configurations 
allow MIDI data to be distributed in the most efficient and understandable 
manner possible. 


As a primary rule, there are only two valid ways to 
connect one MIDI device to another within a MIDI chain 
(Figure 9.11): 

■ The MIDI Out jack of a source device (controller or 
sequencer/computer) must be connected to the MIDI 
In of a second device in the chain. 


■ The MIDI Thru jack of the second device must be 
connected to the MIDI In jack of the third device in the 
chain ... following this same Thru-to-ln convention until 
the end of the chain is reached. 


The daisy chain 

One of the simplest and most common ways to distribute data throughout a 
MIDI system is the daisy chain. This method relays MIDI data from a source 
device (controller or sequencer/computer) to the MIDI In jack of the next device 














Typical Configurations 



Device #1 Device #2 Device #3 

FIGURE 9.11 

The two valid means of connecting one MIDI device to another. 
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FIGURE 9.12 

Example of a 
connected MIDI 
system using a daisy 
Chain: (a) typical 
daisy Chain hookup; 
(b) example of how 
a Computer can 
be connected into 
a daisy Chain. 
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in the chain (which receives and acts on this data). This next device then relays 
an exact copy of the incoming data out to its MIDI Thru jack, which is then 
relayed to the next device in the chain, and so on through the successive devices. 
In this way, up to 16 channels of MIDI data can be chained from one device to 
the next within a connected data network—and it's precisely this concept of 
transmitting múltiple channels through a single MIDI line that makes the whole 
concept work! Let's try to understand this system better by looking at a few 
examples. 

Figure 9.12a shows a simple (and common) example of a MIDI daisy chain 
whereby data flows from a controller (MIDI Out jack of the source device) to a 
synth module (MIDI In jack of the second device in the chain). An exact copy 
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of the data that flows into the second device is then relayed to its MIDI Thru 
jack to another synth (MIDI In jack of the third device in the chain). If our 
controller is set to transmit on MIDI channel 3, the second synth in the chain 
(which is set to channel 2) will ignore the messages and not play, while the 
third synth (which is set to channel 3) will be playing its heart out. The moral 
of this story is that, although there's only one connected data line, a wide 
range of instruments and channel voices can be played in a surprisingly large 
number of combinations—all by using individual channel assignments along 
a daisy chain. 

Another example (Figure 9.12b) shows how a Computer can easily be designated 
as the master source within a daisy chain so a sequencing program can be used 
to control the entire playback and channel routing functions of a daisy-chained 
system. In this situation, the MIDI data flows from a master controller/synth to 
the MIDI In jack of a computer's MIDI interface (where the data can be played 
into, processed and rerouted through a MIDI sequencer). The MIDI Out of the 
interface is then routed back to the MIDI In jack of the master controller/synth 
(which receives and acts on this data). The controller then relays an exact copy 
of this incoming data out to its MIDI Thru jack (which is then relayed to the 
next device in the chain) and so on, until the end of the chain is reached. When 
we stop and think about it, we can see that the controller is essentially used as 
a "performance tool" for entering data into the MIDI sequencer, which is then 
used to communicate this data out to the various instru¬ 
ments throughout the connected MIDI chain. 

The multiport network 

Another common approach to 
routing MIDI throughout a produc- 
tion system involves distributing 
MIDI data through the múltiple 2, 4 
and 8 In/Out ports that are available 
on the newer multiport MIDI interfaces 
or through the use of múltiple MIDI USB 
interface devices. 

In larger, more complex MIDI systems, a multiport MIDI network (Figure 9.13) 
offers several advantages over a single daisy chain path. One of the most impor- 
tant is its ability to address devices within a complex setup that requires 
more than 16 MIDI channels. For example, a 2 x 2 MIDI interface that has two 
independent In/Out paths is cap able of simultaneously addressing up to 32 
channels (i.e., port A 1-16 and port B 1-16), whereas an 8 x 8 port is capable 
of addressing up to 128 individual MIDI channels. 

This type of network of independent MIDI chains has a number of advantages. 
As an example, port A might be dedicated to three instruments that are set to 
respond to MIDI channels 1 to 6, 7 and finally channel 11, whereas port B might 
be transmitting data to two instruments that are responding to channels 1-4 


NOTE 

Although the distinction isn’t overly 
important, you might want to keep in mind that 
a MIDI “port” is a virtual data path that’s 
processed through a Computer, luhereas a 
MIDI "jack” is the physical 
connection on a device itself. 
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USB (4x4 port) 


USB (2x2 port) -► 
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FIGURE 9.13 

Example of a multiport network using two MIDI interfaces. 


and 5-10 and port C might be communicating SysEx MIDI data to and from a 
MIDI remóte controller for a digital audio workstation (DAW). In this modern 
age of audio interfaces, multiport MIDI interfaces and controller devices that 
are each fitted with MIDI ports, it's a simple matter for a Computer to route and 
synchronously communicate MIDI data throughout the studio in any number 
of ingenious and cost-effective ways. 

EXPLORING THE SPEC 

MIDI is a specified data format that must be strictly adhered to by those who 
design and manufacture MIDI-equipped instruments and devices. Because the 
format is standardized, you don't have to worry about whether the MIDI output 
of one device will be understood by the MIDI in port of a device that's made 
by another manufacturer. As long as the data ports say and/or communicate 
MIDI, you can be assured that the data (at least the basic performance func- 
tions) will be transmitted and understood by all devices within the connected 
system. In this way, the user need only consider the day-to-day dealings that go 
hand-in-hand with using electronic instruments, without having to be con¬ 
cerned with data compatibility between devices. 
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The MIDI message 

MIDI digitally communicates musical performance data between devices as a 
string of MIDI messages. These messages are traditionally transmitted through 
a standard MIDI line in a serial fashion at a speed of 31,250 bits/sec. Within a 
serial data transmission line, data travels in a single-file fashion through a single 
conductor cable (Figure 9.14a); a parallel data connection, on the other hand, 
is able to simultaneously transmit digital bits in a synchronous fashion over a 
number of wires (Figure 9.14b). 

When using a standard MIDI cable, it's important to remember that data can 
only travel in one direction from a single source to a destination (Figure 9.15a). 
To make two-way communication possible, a second MIDI data line must be 
used to communicate data back to the device, either directly or through the 
MIDI chain (Figure 9.15b). 

MIDI messages are made up of groups of 8-bit words (known as bytes), which 
are transmitted in a serial fashion to convey a series of instructions to one or 
all MIDI devices within a system. 
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(a) 

FIGURE 9.14 

(b) 


Serial versus parallel data transmission: (a) Serial data must be transmitted in a single-file fashion over a serial data 
line. (b) Múltiple bits of data can be synchronously transmitted over a number of parallel lines. 
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FIGURE 9.15 

MIDI data can only 
travel in one 
direction through a 
single MIDI cable: 

(a) data transmission 
from a single source 
to a destination; 

(b) two-way data 
communication using 
two cables. 
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Only two types of bytes are defined by the MIDI 
specification: the status byte and the data byte. 

■ A status byte ¡s used to identify what type of MIDI 
function ¡s to be performed by a device or program. It 
¡s also used to encode channel data (allowing the 


instruction to be received by a device that’s set to 
respond to the selected channel). 

■ A data byte ¡s used to associate a valué to the event 
that’s given by the accompanying status byte. 


FIGURE 9.16 

Up to 16 channels 
can be communi- 
cated through a 
single MIDI cable. 


Although a byte is made up of 8 bits, the most significant bit (MSB; the leftmost 
binary bit within a digital word) is used solely to identify the byte type. The 
MSB of a status byte is always 1, while the MSB of a data byte is always 0. For 
example, a 3-byte MIDI Note-On message (which is used to signal the beginning 
of a MIDI note) might read in binary form as a 3-byte Note-On message of 
(10010100)(01000000) (01011001). This particular example transmits instruc- 
tions that would be read as: "Transmitting a Note-On message over MIDI 
channel #5, using keynote #64, with an attack velocity fvolume level of a note] 
of 89." 


MIDI channels 


1. mondo pad 

2. taiko drums 

3. piano 

4. solo synth 

5. guitar 

6. strings lo 

7. strings hi 

8. chorus vocals 

9. sandy claws 

10. perc 

11. cow bell 

12. horns 

13. big bass 

14. lo bass 

15. celeste 

16. B3 


Just as a public speaker might single out and 
communicate a message to one individual in a 
crowd, MIDI messages can be directed to com¬ 
municate information to a specific device or 
range of devices within a MIDI system. This is 
done by embedding a channel-related nibble 
(4 bits) within the status/channel number byte. 
This process makes it possible for up to 16 
channels of performance or control informa¬ 
tion to be communicated to a specific device, 
or a sound generator through a single MIDI 
data cable (Figure 9.16). 




Since this nibble is 4 bits wide, up to 16 discrete MIDI channels can be transmitted through a single MIDI cable or 
designated port. 


0000 = CH#1 
0001 = CH#2 
0010 = CH#3 
0011 = CH#4 


0100 = CH#5 
0101 = CH#6 
0110 = CH#7 
0111 = CH#8 


1000 = CH#9 

1001 = CH#10 

1010 = CH#11 

1011 = CH#12 


1100 = CH#13 

1101 = CH#14 

1110 = CH#15 

1111 = CH#16 
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Master Controller (Ch #10 - percussion) 


Synth Module (Ch #3) 
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Sampler Module (Ch #5> 


FIGURE 9.17 

MIDI setup showing a 
set of MIDI channel 
assignments. 


Whenever a MIDI device, sound generator within a device or program function 
is instructed to respond to a specific channel number, it will only react to mes- 
sages that are transmitted on that channel (i.e., it ignores channel messages that 
are transmitted on any other channel). For example, let's assume that we're 
going to create a short song using a synthesizer that has a built-in sequencer (a 
device or program that's capable of recording, editing and playing back MIDI 
data) and two other "synths" (Figure 9.17): 

1. We could easily start off by recording a simple drum pattern track into 
the master synth on channel #10 (numerous synths are preassigned to 
output drum/percussion sounds on this channel). 

2. Once recorded, the sequence will thereafter transmit the notes and data 
over channel 10, allowing the synth's percussion section to be played. 

3. Next, we could set a synth module to channel #3 and instruct the master 
synth to transmit on the same channel (since the synth module is set to 
respond to data on channel 3, its generators will sound whenever the 
master keyboard is played). We can now begin recording a melody line 
into the sequencer's next track. 

4. Playing back the sequence will then transmit data to both the master 
synth (perc section #10) and the module (melody line #3) over their 
respective channels. At this point, our song is beginning to take shape. 

5. Now we can set the other synth (or other instrument type) to respond to 
channel #5 and instruct the master synth to transmit on the same channel, 
allowing us to further embellish the song. 

6. Now that the song's beginning to take shape, the sequencer can play the 
musical parts to the synths on their respective MIDI channels—all in an 
environment that gives us complete control over voicing, volume, panning 
and a wide range of edit functions over each instrument. In short, we've 
created a true multichannel production environment. 

It goes without saying that the above example is just one of the infinite setup 
and channel possibilities that can be encountered in a production environment. 
It's often true, however, that even the most complex MIDI and production 
rooms will have a system—a basic channel and overall layout that makes the 
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day-to-day operation of making music easier. This layout and the basic decisions 
that you might make in your own room are, of course, up to you. Streamlining 
a system to work both efficiently and easily will come with time, experience and 
practice. 


MIDI modes 

Electronic instruments often vary in the number of sounds and notes that can 
be simultaneously produced by their internal sound-generating circuitry. For 
example, certain instruments can only produce one note at a single time (known 
as a monophonic instrument), while others can generate 16, 32 and even 64 
notes at once (these are known as polyphonic instruments). The latter type 
can easily play chords or more than one musical line on a single instrument at 
a time. 


FIGURE 9.18 

Multitimbral 
instruments are 
virtual bands-in-a- 
box that can 
simultaneously 
generate múltiple 
patches, each of 
which can be 
assigned to its own 
MIDI channel. 


In addition, some instruments are only capable of producing a single generated 
sound patch (often referred to as a "voice") at any one time. Its generating 
circuitry could be polyphonic, allowing the player to lay down chords and bass 
or melody lines, but it can only produce these notes using a single, character- 
istic sound at any one time (e.g., an electric piano, a synth bass or a string 
patch). However, the vast majority of newer synths differ from this in that they're 
multitimbral in nature, meaning that they can generate numerous sound patches 
at any one time (e.g., an electric piano, a synth bass and a string patch, as can 
be seen in Figure 9.18). That's to say that it's common to run across electronic 
instruments that can simultaneously generate a number of voices, each offering 
its own control over a wide range of parameters. Best of all, it's also common 
for different sounds to be assigned to their own MIDI channels, allowing múl¬ 
tiple patches to be internally mixed within the device to a stereo output bus or 
independent outputs. 


The following list and figures explain the four modes that are supported by the 
MIDI spec: 

■ Mode 1 (Omni On/Poly): In this mode, an instrument will respond to data 
that's being received on any MIDI channel and then redirect this data to 
the instrumenté base channel. In essence, the device will play back every- 
thing that's presented at its input in a polyphonic fashion, regardless of 
the incoming channel designations. As you might guess, this mode is 

rarely used. 

■ Mode 2 (Omni On/Mono): As 
in Mode 1, an instrument will 
respond to all data that's being 
received at its input, without 
regard to channel designations; 
however, this device will only 
be able to play one note at a 
time. Mode 2 is used even more 
rarely than Mode 1, as the 
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device can't discrimínate channel designations and can only play one note 
at a time. 

■ Mode 3 (Omni Off/Poly): In this mode, an instrument will only respond 
to data that matches its assigned base channel in a polyphonic fashion. 
Data that's assigned to any other channel will be ignored. This mode is 
by far the most commonly used, as it allows the voices within a multit- 
imbral instrument to be individually controlled by messages that are 
being received on their assigned MIDI channels. For example, each of the 
16 channels in a MIDI line could be used to independently play each of 
the parts in a 16-voice, multitimbral synth. 

■ Mode 4 (Omni Off/Mono): As with Mode 3, an instrument will be able to 
respond to performance data that's transmitted over a single, dedicated 
channel; however, each voice will only be able to generate one MIDI note 
at a time. A practical example of this mode is often used in MIDI guitar 
Systems, where MIDI data is monophonically transmitted over six con- 
secutive channels (one channel/voice per string). 


Channel voice messages 

Channel Voice messages are used to transmit real-time performance data 
throughout a connected MIDI system. They're generated whenever a MIDI 
instrumenté controller is played, selected or varied by the performer. Examples 
of such control changes could be the playing of a keyboard, pressing of program 
selection buttons or movement of modulation or pitch wheels. Each Channel 
Voice message contains a MIDI channel number within its status byte, meaning 
that only devices that are assigned to the same channel number will respond 
to these commands. There are seven Channel Voice message types: Note-On, 
Note-Off, Polyphonic Key Pressure, Channel Pressure, Program Change, Pitch 
Bend Change and Control Change: 


Note-On messages (Figure 9.19): Indícate the beginning of a MIDI note. 
This message is generated each time a note is triggered on a keyboard, 
drum machine or other MIDI instrument (by pressing a key, striking a 
drum pad, etc.). A Note-On message consists of three bytes of informa- 
tion: a MIDI channel number, a MIDI pitch number, and an attack veloc- 
ity valué (messages that are used to transmit the individually played 
volume levels [0-127] of each note). 

Note-Off messages: Indícate the release 
(end) of a MIDI note. Each note played 
through a Note-On message is sustained 
until a corresponding Note-Off message 
is received. A Note-Off message doesn't 
cut off a sound; it merely stops playing 
it. If the patch being played has a release 
(or final decay) stage, it begins that 
stage upon receiving this message. It 
should be noted that many systems will 


Status/Ch # 
(0-15) 


Note # 
(0-127) 


FIGURE 9.19 

Byte structure of a 
MIDI Note-On 
message. 

Attack Velocity 
(0-127) 


(1001 CCCC) (0NNNNNNN) (0VW WVV) 
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actually use a Note-On message with a velocity 0 to denote a Note-Off 
message. 

■ Polyphonic Key Pressure messages (Figure 9.20): Transmitted by instruments 
that can respond to pressure changes applied to the individual keys of a 
keyboard. A Polyphonic Key Pressure message consists of three bytes of 
information: a MIDI channel number, a MIDI pitch number and a pres¬ 
sure valué. 

■ Channel Pressure (or Aftertouch) messages (Figure 9.21): Transmitted and 
received by instruments that respond to a single, overall pressure applied 
to the keys. In this way, additional pressure on the keys can be assigned 
to control such variables as pitch bend, modulation and panning. 

■ Program Change messages (Figures 9.22 and 9.23): Change the active voice 
(generated sound) or preset program number in a MIDI instrument or 
device. LJsing this message format, up to 128 presets (a user- or factory- 
defined number that activates a specific sound-generating patch or system 
setup) can be selected. A Program Change message consists of two bytes 
of information: a MIDI channel number (1-16) and a program ID number 
(0-127). 

■ Pitch Bend Change messages (Figures 9.24 and 9.25): Transmitted by an 
instrument whenever its pitch bend wheel is moved in either the positive 


FIGURE 9.20 

Byte structure of a 
MIDI Polyphonic Key 
Pressure message 
(generated when 
additional pressure is 
applied to each key 
that’s played). 


Status/Ch # Note U Pressure Valué 
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FIGURE 9.21 

Byte structure of a 
MIDI Channel 
Pressure message 
(simultaneously 
affect all notes that 
are transmitted over 
a MIDI channel). 


FIGURE 9.22 

Program Change 
messages can be 
used to change 
sound patches from 
a sequencer or from 
a remóte controller. 
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FIGURE 9.23 

Workstations and sequencer software systems will often allow patches to be recalled via Program Change messages. 
(Courtesy of Steinberg Media Technologies GmbH, a división of Yamaha Corporation, www.steinberg.net, and Digide- 
sign, a división of Avid Technology, www.digidesign.com.) 
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FIGURE 9.24 

Byte structure of a 
Pitch Bend Change 
message. 



Minimum Valué = 0 Mid Valué = 64 Máximum Valué = 127 


FIGURE 9.25 

Pitch bend wheel 
data valué ranges. 


(raise pitch) or negative (lower pitch) direction from its central (no pitch 
bend) position. 

■ Control Change messages (Figures 9.26 and 9.27): Transmit information 
that relates to real-time control over a MIDI instrument's performance 
parameters (such as modulation, main volume, balance and panning). 
Three types of real-time Controls can be communicated through 
control change messages: continuous controllers, which communicate a 
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FIGURE 9.26 

M-audio controller. 
(Courtesy of M-Audio, 
a división of Avid 
Technology, Inc., 
www.m-audio.com.) 


MIDI controllers 



pitch bend & modulation wheels 


FIGURE 9.27 

Control Change 
message: (a) byte 
structure; (b) control 
messages can be 
varied in real time or 
under automation 
using a number of 
input methods. 
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continuous range of control settings, generally with valúes ranging from 
0-127; switches (controls having an ON or OFF State with no intermedí¬ 
ate settings); and data controllers, which enter data either through numer- 
ical keypads or stepped up/down entry buttons. 

Explanation of controller ID parameters 

As you can see in Figure 9.19, the 2nd byte of the Control Change message is 
used to denote the controller ID number. This all-important valué is used to 
specify which of the device's program or performance parameters are to be 
addressed. 

The following section details the general categories and conventions for assign- 
ing controller ID numbers to an associated parameter (as specified by the 1995 
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update of the MMA (MIDI Manufacturers Association, www.midi.org). An Over¬ 
View of these controllers can be seen in Table 9.1. This is definitely an important 
table to earmark, because these numbers will be an important guide toward 
knowing and/or finding the right ID number that can help you on your path 
toward finding that perfect variable for making it sound right. 

System messages 

As the ñame implies, System messages are globally transmitted to every MIDI 
device in the MIDI chain. This is accomplished because MIDI channel numbers 
aren't addressed within the byte structure of a System message. Thus, any device 
will respond to these messages, regardless of its MIDI channel assignment. The 
three System message types are System-Common messages, System Real-Time 
messages, and System-Exclusive messages. 

System-Common messages are used to transmit MIDI timecode, song position 
pointer, song select, tune request and end-of-exclusive data messages through- 
out the MIDI system or 16 channels of a specified MIDI port. 

■ MIDI timecode (MTC) messages: Provide a cost-effective and easily imple- 
mented way to transíate SMPTE (a standardized synchronization time¬ 
code) into an equivalent code that conforms to the MIDI 1.0 spec. It 
allows time-based codes and commands to be distributed throughout the 
MIDI chain in a cheap, stable and easy-to-implement way. MTC Quarter- 
Frame messages are transmitted and recognized by MIDI devices that can 
understand and execute MTC commands. A grouping of eight quarter 
frames is used to denote a complete timecode address (in hours, minutes, 
seconds, and frames), allowing the SMPTE address to be updated every 
two frames. More in-depth coverage of MIDI timecode can be found in 
Chapter 11. 

■ Song Position Pointer (SPP) messages: Allow a sequencer or drum machine 
to be synchronized to an external source (such as a tape machine) from 
any measure position within a song. This complex timing protocol isn't 
commonly used, because most users and design layouts currently favor 
MTC. 

■ Song Select messages: Use an identifying song ID number to request a 
specific song from a sequence or controller source. After being selected, 
the song responds to MIDI Start, Stop and Continué messages. 

■ Tune Request messages: Used to request that an equipped MIDI instrument 
initiate its internal tuning routine. 

■ End of Exclusive (EOX) messages: Indicate the end of a System-Exclusive 
message. 

System Real-Time messages provide the precise timing element required to 
synchronize all of the MIDI devices in a connected system. To avoid timing 
delays, the MIDI specification allows System Real-Time messages to be inserted 
at any point in the data stream, even between other MIDI messages. 
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Table 9.1 

Listing of Controller ID Numbers, Outlining Both the Defined 
Format and Conventional Controller Assignments 

Control # 

Parameter 

14-Bit Controllers Coarse/MSB (Most Significant Bit) 

0 

Bank Select 0-127 MSB 

1 

Modulation Wheel or Lever 0-127 MSB 

2 

Breath Controller 0-127 MSB 

3 

Undefined 0-127 MSB 

4 

Foot Controller 0-127 MSB 

5 

Portamento Time 0-127 MSB 

6 

Data Entry MSB 0-127 MSB 

7 

Channel Volume (formerly Main Volume) 0-127 MSB 

8 

Balance 0-127 MSB 

9 

Undefined 0-127 MSB 

10 

Pan 0-127 MSB 

11 

Expression Controller 0-127 MSB 

12 

Effect Control 1 0-127 MSB 

13 

Effect Control 2 0-127 MSB 

14 

Undefined 0-127 MSB 

15 

Undefined 0-127 MSB 

16-19 

General Purpose Controllers 1-4 0-127 MSB 

20-31 

Undefined 0-127 MSB 

14-Bit Controllers Fine/LSB (Least Significant Bit) 

32 

LSB for Control 0 (Bank Select) 0-127 LSB 

33 

LSB for Control 1 (Modulation Wheel or Lever) 0-127 LSB 

34 

LSB for Control 2 (Breath Controller) 0-127 LSB 

35 

LSB for Control 3 (Undefined) 0-127 LSB 

36 

LSB for Control 4 (Foot Controller) 0-127 LSB 

37 

LSB for Control 5 (Portamento Time) 0-127 LSB 

38 

LSB for Control 6 (Data Entry) 0-127 LSB 

39 

LSB for Control 7 (Channel Volume, formerly Main Volume) 
0-127 LSB 


Continued ... 
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Table 9.1 

continued 

Control # 

Parameter 

40 

LSB for Control 8 (Balance) 0-127 LSB 

41 

LSB for Control 9 (Undefined) 0-127 LSB 

42 

LSB for Control 10 (Pan) 0-127 LSB 

43 

LSB for Control 11 (Expression Controller) 0-127 LSB 

44 

LSB for Control 12 (Effect control 1) 0-127 LSB 

45 

LSB for Control 13 (Effect control 2) 0-127 LSB 

46-47 

LSB for Control 14-15 (Undefined) 0-127 LSB 

48-51 

LSB for Control 16-19 (General Purpose Controllers 1-4) 

0-127 LSB 

52-63 

LSB for Control 20-31 (Undefined) 0-127 LSB 

7-Bit Controllers 

64 

Damper Pedal on/off (Sustain) <63 off, >64 on — 

65 

Portamento On/Off <63 off, >64 on — 

66 

Sustenuto On/Off <63 off, >64 on — 

67 

Soft Pedal On/Off <63 off, >64 on — 

68 

Legato Footswitch <63 Normal, >64 Legato — 

69 

Hold 2 <63 off, >64 on — 

70 

Sound Controller 1 (default: Sound Variation) 0-127 LSB 

71 

Sound Controller 2 (default: Timbre/Harmonic Intens.) 0-127 
LSB 

72 

Sound Controller 3 (default: Release Time) 0-127 LSB 

73 

Sound Controller 4 (default: Attack Time) 0-127 LSB 

74 

Sound Controller 5 (default: Brightness) 0-127 LSB 

75 

Sound Controller 6 (default: Decay Time: see MMA RP-021) 
0-127 LSB 

76 

Sound Controller 7 (default: Vibrato Rate: see MMA RP-021) 
0-127 LSB 

77 

Sound Controller 8 (default: Vibrato Depth: see MMA RP-021) 
0-127 LSB 

78 

Sound Controller 9 (default: Vibrato Delay: see MMA RP-021) 
0-127 LSB 


Continued ... 
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Table 9.1 

continued 

Control # 

Parameter 

79 

Sound Controller 10 (default undefined: see MMA RP-021) 
0-127 LSB 

80-83 

General Purpose Controller 5-8 0-127 LSB 

84 

Portamento Control 0-127 LSB 

85-90 

Undefined —— 

91 

Effects 1 Depth (default: Reverb Send Level) 0-127 LSB 

92 

Effects 2 Depth (default: Tremolo Level) 0-127 LSB 

93 

Effects 3 Depth (default: Chorus Send Level) 0-127 LSB 

94 

Effects 4 Depth (default: Celesta [Detune] Depth) 0-127 LSB 

95 

Effects 5 Depth (default: Phaser Depth) 0-127 LSB 

Parameter Valué Controllers 

96 

Data Increment (Data Entry +1) 

97 

Data Decrement (Data Entry -1) 

98 

Non-Registered Parameter Number (NRPN): LSB 0-127 LSB 

99 

Non-Registered Parameter Number (NRPN): MSB 0-127 MSB 

100 

Registered Parameter Number (RPN): LSB* 0-127 LSB 

101 

Registered Parameter Number (RPN): MSB* 0-127 MSB 

102-119 

Undefined- 

Reserved for Channel Mode Messages 

120 

All Sound Off 0 — 

121 

Reset All Controllers 

122 

Local Control On/Off 0 off, 127 on 

123 

All Notes Off 

124 

Omni Mode Off (+ all notes off) 

125 

Omni Mode On (+ all notes off) 

126 

Poly Mode On/Off (+ all notes off) 

127 

Poly Mode On (+ mono off +all notes off) 
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■ Timing Clock messages: The MIDI Timing Clock message is transmitted 
within the MIDI datastream at various resolution rates. It is used to syn- 
chronize the internal timing clocks of each MIDI device within the system 
and is transmitted in both the start and stop modes at the currently 
defined tempo rate. In the early days of MIDI, these rates (which are mea- 
sured in pulses per quarter note [ppq]) ranged from 24 to 128 ppq; 
however, continued advances in technology have brought these rates up 
to 240, 480 or even 960 ppq. 

■ Start messages: lipón receipt of a timing clock message, the MIDI Start 
command instructs all connected MIDI devices to begin playing from their 
internal sequences' initial start point. Should a program be in midse- 
quence, the start command will reposition the sequence back to its begin- 
ning, at which point it will begin to play. 

■ Stop messages: lipón receipt of a MIDI Stop command, all devices within 
the system will stop playing at their current position point. 

■ Continué messages: After receiving a MIDI Stop command, a MIDI Con¬ 
tinué message will instruct all connected devices to resume playing their 
internal sequences from the precise point at which it was stopped. 

■ Active Sensing messages: When in the Stop mode, an optional Active 
Sensing message can be transmitted throughout the MIDI datastream 
every 300 milliseconds. This instructs devices that can recognize this 
message that they're still connected to an active MIDI data stream. 

■ System Reset messages: A System Reset message is manually transmitted 
in order to reset a MIDI device or instrument back to its initial power- 
up default settings (commonly mode 1, local control on and all notes 
off). 

System-exclusive messages 

The System-Exclusive (Sys-Ex) message allows MIDI manufacturers, program- 
mers and designers to communicate customized MIDI messages between MIDI 
devices. The purpose of these messages is to give manufacturers, programmers 
and designers the freedom to communicate any device-specific data of an unre- 
stricted length, as they see fit. Most commonly, sys-ex data are used for the bulk 
transmission and reception of program/patch data and sample data, as well as 
real-time control over a device's parameters. The transmission format of a sys-ex 
message (Figure 9.28), as defined by the MIDI standard, ineludes a sys-ex status 
header, manufacturéis ID number, any number of sys-ex data bytes and an EOX 
byte. When a sys-ex message is received, the identification number is read by a 
MIDI device to determine whether or not the following messages are relevant. 
This is easily accomplished by the assignment of a unique 1- or 3-byte ID 
number to each registered MIDI manufacturer and make. If this number doesnT 
match the receiving MIDI device, the subsequent data bytes will be ignored. 
Once a valid stream of sys-ex data has been transmitted, a final EOX message 
is sent, after which the device will again begin to respond normally to incoming 
MIDI performance messages. 
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FIGURE 9.28 

System-exclusive ID 
data and controller 
format. 



SysEx Status Manufacturéis ID 


* 



(1111 0000) (ODDO DDDD) 


~i 


{undefined number of data byt.es) 


i 


{1111 0111 } 

End of Exclusive (EOX) 


In actual practice, the general idea behind sys-ex is that it uses MIDI messages 
to transmit and receive program, patch and sample data or real-time parameter 
information between devices. It's sort of like having an instrument or devi ce 
that's a musical chameleon. One moment it can be configured with a certain 
set of sound patches and setup data and then, after it receives a new sys-ex data 
dump, you could easily end up with an instrument that's literally full of new 
and exciting (or not-so-exciting) sounds and settings. Here are a few examples 
of how sys-ex can be put to good use: 

■ Transmitting patch data between synths: Sys-ex can be used to transmit patch 
and overall setup data between synths of identical make and (most often) 
model. Let's say that we have a Brand X Model Z synthesizer, and, as it 
turns out, you have a buddy across town that also has a Brand X Model 
Z. That's cool, except your buddy has a completely different set of sound 
patches in her synth ... and you want them! Sys-ex to the rescue! All 
you need to do is go over and transfer your buddy's patch data into your 
synth (to make life easier, make sure you take the instruction manual 
along). 

■ Backing up your current patch data: This can be done by transmitting a sys-ex 
dump of your synth's entire patch and setup data to disk, to a sys-ex utility 
program (often Shareware) or to your DAW/MIDI sequencer. This is 
important: Back up your factory preset or current patch data before attempting 
a sys-ex dump! If you forget and download a sys-ex dump, your previous 
settings will be lost until you contact the manufacturen download the 
dump from their website or take your synth back to your favorite music 
store to reload the data. 

■ Getting patch data from the Web: One of the biggest repositories of sys-ex 
data is the Internet. To surf the Web for sys-ex patch data, all you need to 
do is log on to your favorite search engine site and enter the ñame of your 
synth. You'll probably be amazed at how many hits will come across the 
screen, many of which are chock-full of sys-ex dumps that can be down- 
loaded into your synth. 

■ Varying sys-ex controller or patch data in real time: Patch editors or hardware 
MIDI controllers can be used to vary system and sound-generating param- 
eters, in real time. Both of these controller types can ease the job of 
experimenting with parameter valúes or changing mix moves by giving 
you physical or on-screen Controls that are often more intuitive and easier 
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to deal with than programming electronic instruments that'll often leave 
you dangling in cursor and 3-inch LCD screen hell. 

Before moving on, I should also point out that sys-ex data grabbed from the 
Web, disk, disc or any other médium will often be encoded using several sys-ex 
file format styles (unfortunately, none of which are standardized). Unfortu- 
nately, sequencer Y might not recognize a sys-ex dump that was encoded using 
sequencer Z. For this reason, dumps are often encoded using easily available, 
standard sys-ex utility programs for the Mac or PC. 

At last, it seems that a single unified standard has begun to emerge from the 
fray that's so simple that it's amazing it wasn't universally adopted from the 
start. This system simply records a sys-ex dump as data on a single MIDI track 
file. Before recording a dump to a sequencer track, you may need to consult the 
manual to make sure that sys-ex filtering is turned off. Once this is done, simply 
place the track into record mode, initiate the dump and save the track in an 
all-important sys-ex dump directory. Using this approach, it would also be pos- 
sible to: 

■ Import the appropriate sys-ex dump track (or set of tracks) into the current 
working session so as to automatically program the instruments before 
the sequence is played back. 

■ Import the appropriate sys-ex dump track (or set of tracks) into sepárate 
MIDI tracks that can be muted or unassigned. Should the need arise, the 
track(s) can be activated and/or assigned in order to dump the data into 
the appropriate instruments. 


MIDI AND THE PERSONAL COMPUTER 

Besides the coveted place of honor in which most electronic musicians hold 
their instruments, the most important device in a MIDI system is undoubtedly 
the personal Computer. Through the use of software programs and peripheral 
hardware, the Computer is often used to control, process and distribute informa- 
tion relating to music performance and production from a centralized, inte- 
grated control position. 

Of course, two Computer types dominate modern-day music production: the 
PC and the Mac. In truth, each brings its own particular set of advantages 
and disadvantages to personal computing, although their differences have 
greatly dwindled over the years. My personal take on the matter (a subject 
that's not even important enough to debate) is that it's a dual-platform world. 
The choice is yours and yours alone to make. ... Many professional software 
and hardware Systems can work on either platform. As I write this, some of my 
music collaborators are fully Mac, some are PC and some (like me) use both, 
and it doesn't affect our production styles at all. Coexistence isn't much of a 
problem, either. Living a dual-platform existence can give you the edge of 
being familiar with both systems, which can be downright handy in a sticky 
production pinch. 


MIDI and the Personal Computer 


Connecting to the peripheral world 

An important event in the evolution of personal computing has been the 
maturation of hardware and processing peripherals. With the development of 
the USB (www.usb.org) and FireWire (www.1394ta.org) protocols, hardware 
devices such as mice, keyboards, cameras, audio interfaces, MIDI interfaces, CD 
and hard drives, MP3 players and even portable fans can be plugged into an 
available port without any need to change frustrating hardware settings or open 
up the box. External peripherals are generally hardware devices that are designed 
to do a specific task or range of production tasks. For example, an audio inter¬ 
face is capable of translating analog audio (and often MIDI, control and other 
media) into digital data that can be understood by the Computer. Other periph¬ 
eral devices can perform such useful functions as printing, media interfacing 
(video and MIDI), scanning, memory card interfacing, portable hard disk 
storage ... the list could literally fill pages. 

The MIDI interface 

Although computers and electronic instruments both communicate using the 
digital language of l's and O's, computers simply can't understand the language 
of MIDI without the use of a device that translates the serial messages into a 
data structure that computers can comprehend. Such a device is known as the 
MIDI interface. A wide range of MIDI interfaces currently exist that can be used 
with most Computer system and OS platforms. For the casual and professional 
musician, interfacing MIDI into a production system can be done in a number 
of ways. Probably the most common way to access MIDI In, Out and Thru jacks 
is on modern-day USB or FireWire audio interface or controller surface (Figures 

9.29) . It's become a common matter for portable devices to offer 16 channels 
of 1/0 (on one port), while multichannel interfaces often inelude múltiple MIDI 
1/0 ports that can give you access to 32 or more channels. 

The next option is to choose a USB MIDI interface that can range from simpler 
devices that inelude a single port to multiple-port Systems that can easily handle 
up to 128 channels over eight 1/0 ports. The multiport MIDI interface (Figure 

9.30) is often the device of choice for most professional electronic musicians 
who require added routing and synchronization capabilities. These rack- 
mountable USB devices can be used to provide eight independent MIDI Ins and 


FIGURE 9.29 

Most audio interface 
devices inelude MIDI 
1/0 ports. (Courtesy 
of Tascam, www. 
tascam.com.) 
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FIGURE 9.30 

M-Audio MIDISPORT 
8x8/s MIDI interface. 
(Courtesy of M-Audio, 
a división of Avid 
Technology, Inc., 
www.m-audio.com.) 


Outs to easily distribute MIDI data through sepárate lines over a connected 
network. 

In addition to distributing MIDI data, these Systems often inelude driver soft¬ 
ware that can route and process MIDI data throughout the MIDI network. For 
example, a multiport interface could be used to merge together several MIDI 
Ins (or Outs) into a single datastream, filter out specific MIDI message types 
(used to block out unwanted commands that might adversely change an instru¬ 
menté sound or performance) or rechannel data being transmitted on one 
MIDI channel or port to another channel or port (thereby allowing the data to 
be recognized by an instrument or devi ce). 

Another important function that can be handled by most multiport interfaces 
is synchronization. Synchronization (sync, for short) allows other, external 
devices (such as DAWs, video decks and other media systems) to be simultane- 
ously played back using the same timing reference. Interfaces that ineludes sync 
features will often read and write SMPTE timecode, convert SMPTE to MIDI 
timecode (MTC) and allow recorded timecode signáis to be cleaned up when 
copying code from one analog device to another (jam sync). Further reading 
on synchronization can be found in Chapter 11. 

In addition to the above interface types, a number of MIDI keyboard controllers 
and synth instruments have been designed with MIDI ports and jacks built right 
into them. For those getting started, this useful and cost-saving feature makes 
it easy to intégrate your existing instruments into your DAW and sequencing 
environment. 

ELECTRONIC INSTRUMENTS 

Since their inception in the early 1980s, MIDI-based electronic instruments have 
played a central and important role in the development of music technology 
and production. These devices (which fall into almost every instrument cate- 
gory), along with the advent of cost-effective analog and digital audio recording 
systems, have probably been the most important technological advances to 
shape the industry into what it is today. In fact, the combination of hardware 
and newer software plug-in technologies has turned the personal project 
studio into one of the most important driving forces behind modern-day music 
production. 



Electronic Instruments 


Inside the toys 

Although electronic instruments often differ from one another in looks, form 
and function, they almost always share a common set of basic building block 
components (Figure 9.31), including the following: 

■ Central Processing units (CPUs): CPUs are one or more dedicated comput- 
ing devices (often in the form of a specially manufactured microprocessor 
chip) that contain all of the necessary instructional brains to control the 
hardware, voice data and sound-generating capabilities of the entire 
instrument or device. 

■ Performance controllers: These inelude such interface devices as music key- 
boards, knobs, buttons, drum pads and/or wind controllers for inputting 
performance data directly into the electronic instrument in real time or 
for transforming a performance into MIDI messages. Not all instruments 
have a built-in controller. These devices (commonly known as modules) 
contain all the necessary processing and sound-generating circuitry; 
however, the idea is to save space in a cramped studio by eliminating 
redundant keyboards or other controller surfaces. 

■ Control panel: The control panel is the all-important human interface of 
data entry Controls and display panels that let you select and edit sounds 
and route and mix output signáis, as well as control the instrumenté basic 
operating functions. 

■ Memory: Digital memory is used for storing important internal data (such 
as patch information, setup configurations and/or digital waveform data). 
This digital data can be encoded in the form of either read-only memory 
(ROM; data that can only be retrieved from a factory-encoded chip, car- 
tridge, or CD/DVD-ROM) or random access memory (RAM; memory that 
can be read from and stored to a device's resident memory, cartridge, hard 
disk or recordable media). 

■ Voice circuitry: Depending on the device type, this section can chain 
together digital processing "blocks" to either generate sounds (voices) or 


FIGURE 9.31 

The basic building 
blocks of an 
electronic musical 
instrument. 
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process and reproduce digital samples that are recorded into memory for 
playback according to a specific set of parameters. In short, it's used to 
generate or reproduce a sound patch, which can then be processed, ampli- 
fied and heard via speakers or headphones. 

■ Auxiliary controllers: These are external controlling devices that can be used 
in conjunction with an instrument or controller. Examples of these inelude 
foot pedáis (providing continuous-controller data), breath controllers, 
and pitch-bend or modulation wheels. Some of these controllers are con- 
tinuous in nature, while others exist as a switching function that can be 
turned on and off. Examples of the latter inelude sustain pedáis and 
vibrato switches. 

■ MIDI Communications ports: These data ports and physical jacks are used 
to transmit and/or receive MIDI data. 

Generally, no direct link is made between each of these functional blocks; the 
data from each of these components is routed and processed through the instru¬ 
menté CPU. For example, should you wish to select a certain sound patch from 
the instrumenté control panel, the control panel could be used to instruct the 
CPU to recall all of the waveform and sound-patch parameters from memory 
that are associated with the particular sound. These instructional parameters 
would then be used to modify the internal voice circuitry, so that when a key 
on the keyboard is pressed or a MIDI Note-On message is received, the sound 
generators will output the desired patch's note and level valúes. 

Instrument plug-ins 

In recent years, an almost staggering range of software instruments has come 
onto the market as instrument plug-ins. These Systems, which inelude all known 
types of synths, samplers and pitch- and sound-altering devices, are able to 
communicate MIDI, audio, timing sync and control data between the software 
instrument (or effect plug-in) and a host DAW program/CPU processor. 

Using an established plug-in Communications protocol, it's possible for most 
or all of the audio and timing data to be routed through the host audio appli- 
cation, allowing the instrument or application I/O, timing and control param¬ 
eters to be seamlessly integrated into the DAW or application. A few of these 
protocols inelude: 

■ Steinberg's VST 

■ Digidesign's RTAS 

■ MOTU's MAS. 

Propellerhead's ReWire is another type of protocol that allows audio, perfor¬ 
mance and control data of an independent audio program to be wired into a 
host program (usually a DAW) such that the audio routing and sync timing of 
the slave program is locked to the host DAW, effectively allowing them to work 
in tándem as a single production environment. Further reading on plug-in pro¬ 
tocols and virtual and plug-in instruments can be found in Chapters 7 and 8. 


Electronic Instruments 


For the remainder of this section, we'll be discussing the various types of MIDI 
instruments and controller devices that are currently available on the market. 
These instruments can be grouped into such categories as keyboards, percus- 
sion, controlling devices, MIDI guitars and strings and sequencers. 

Keyboards 

By far, the most common instruments that you'll encounter in almost any MIDI 
production facility will probably belong to the keyboard family. This is due, in 
part, to the fact that keyboards were the first electronic music devices to gain 
wide acceptance; also, MIDI was initially developed to record and control many 
of their performance and control parameters. The two basic keyboard-based 
instruments are the synthesizer and the digital sampler. 

THE 5YNTH 

A synthesizer (or synth) is an electronic instrument that uses múltiple sound 
generators, filters and oscillator blocks to create complex waveforms that can 
be combined into countless sonic variations. These synthesized sounds have 
become a basic staple of modern music and range from those that sound 
"cheesy" to ones that realistically mimic traditional instruments ... and all the 
way to those that generate otherworldly, ethereal sounds that literally defy clas- 
sification. 

Synthesizers generate sounds using a number of different technologies or 
program algorithms. Examples of these inelude: 

■ FM synthesis: This technique generally makes use of at least two signal 
generators (commonly referred to as "operators") to create and modify a 
voice. It often does this by generating a signal that modulates or changes 
the tonal and amplitude characteristics of a base carrier signal. More 
sophisticated FM synths use up to four or six operators per voice, each 
using filters and variable amplifier types to alter a signal's characteristics. 

■ Wavetable synthesis: This technique works by storing small segments of 
digitally sampled sound into a memory media. Various sample-based and 
synthesis techniques make use of looping, mathematical interpolation, 
pitch shifting and digital filtering to create extended and richly textured 
sounds that use a surprisingly small amount of sample memory, allowing 
hundreds if not thousands of samples and sound variations to be stored 
in a single device or program. 

■ Additive synthesis: This technique makes use of combined waveforms that 
are generated, mixed and varied in level over time to create new timbres 
that are composed of múltiple and complex harmonics, which vary over 
time. Subtractive synthesis makes extensive use of filtering to alter and 
subtract overtones from a generated waveform (or series of waveforms). 

Of course, synths come in all shapes and sizes and use a wide range of patented 
synthesis techniques for generating and shaping complex waveforms, in a poly- 
phonic fashion using 16, 32 or even 64 simultaneous voices (Figures 9.32 and 
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FIGURE 9.32 

A6 Andrómeda 
analog-modeled 
digital synthesizer. 
(Courtesy of Alesis, 
www.alesis.com.) 
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FIGURE 9.33 

Steinberg Xphraze 
synth plug-in. 
(Courtesy of 
Steinberg Media 
Technologies GmbH, 
a división of Yamaha 
Corporation, www. 
steinberg.net.) 


9.33). In addition, many synths often inelude a percussion section that can play 
a full range of drum and "perc" sounds, in a number of styles. Reverb and other 
basic effeets are also commonly built into the architecture of these devices, 
reducing the need for using extensive outboard effeets when being played on- 
stage or out of the box. Speaking of "out of the box," a number of synth systems 
are referred to as being "workstations." Such beasties are designed (at least in 
theory) to handle many of your basic production needs (including basic sound 
generation, MIDI sequencing, effeets, etc.)—all in one neat little package. 

SAMPLERS 

A sampler (Figures 9.34 through 9.36) is a device that can convert audio into 
a digital form that is then imported into internal random access memory 
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FIGURE 9.34 

Akai MPC1000 Music 
Production Center. 
(Courtesy of Akai 
Professional, www. 
akaipro.com.) 



FIGURE 9.35 

HALion 3 software 
sampling plug-in. 
(Courtesy of 
Steinberg Media 
Technologies GmbH, 
a división of Yamaha 
Corporation, www. 
steinberg.net.) 
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(RAM). Once audio has been sampled or loaded into RAM (from disk, disc or 
diskette), segments of sampled audio can then be edited, transposed, processed 
and played in a polyphonic, musical fashion. In short, a sampler can be thought 
of as a digital audio memory device that lets you record, edit and reload samples 
into RAM. Once loaded, these sounds (whose length and complexity are often 
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FIGURE 9.36 

Reason’s NN-19 and 
NN-XT sample 
modules. (Courtesy 
of Propellerhead 
Software, www. 
propellerheads.se.) 


only limited by memory size and your imagination) can be looped, modulated, 
filtered and amplified (according to user or factory setup parameters) in a way 
that allows the waveshapes and envelopes to be modified. Signal processing 
capabilities, such as basic editing, looping, gain changing, reverse, sample-rate 
conversión, pitch change and digital mixing can also be easily applied to change 
the sounds in an almost infinite number of ways. 

Samplers derive their sounds from recorded and/or imported audio data stored 
as digital audio data within a personal Computer or other media storage device. 
LJsing its digital signal processing (DSP) capabilities, most software samplers 
are able to store and access samples within the internal memory to: 

■ Import previously recorded sound files (often in .wav, .aif and other 
common formats) 

■ Edit and loop sounds into a usable form 

■ Vary envelope parameters (e.g., dynamics over time) 

■ Vary processing parameters 

■ Save the edited sample performance setup as a file for later recall. 

A sample can be played according to the standard Western musical scale (or any 
other scale, for that matter) by altering the playback sample rate over the con- 
troller's note range. For example, pressing a low-pitched key on the keyboard 
will cause the sample to be played back at a lower sample rate, while pressing 
a high-pitched one will cause the sample to be played back at rates that would 
put Mickey Mouse to shame. By choosing the proper sample-rate ratios, these 
sounds can be polyphonically played (whereby múltiple notes are sounded at 
once) at pitches that correspond to standard musical chords and intervals. 
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FIGURE 9.37 

Example of a 
sampler’s keyboard 
layout that has been 
programmed to 
inelude zones. Notice 
that the upper 
register has been 
split into several 
zones that are 
triggered by varying 
velocities. 


hard grand piano loud honky piaría 



upright bass soft grand piano soft honky piano 

JL A A 



A sampler (or synth) with a specific number of voices (e.g., 64 voices) simply 
means that up to 64 notes can be simultaneously played on a keyboard at any 
one time. Each sample in a multiple-voice system can be assigned across a 
performance keyboard, using a process known as splitting or mapping. In this 
way, a sound can be assigned to play across the performance surface of a con¬ 
troller over a range of notes, known as a zone (Figure 9.37). In addition to 
grouping samples into various zones, velocity can enter into the equation by 
allowing múltiple samples to be layered across the same keys of a controller, 
according to how soft or hard they are played. For example, a single key might 
be layered so that pressing the key lightly would reproduce a softly recorded 
sample, while pressing it harder would produce a louder sample with a sharp, 
percussive attack. In this way, mapping can be used to create a more realistic 
instrument or wild set of soundscapes that change not only with the played 
keys but with different velocities as well. Most samplers have extensive edit 
capabilities that allow the sounds to be modified in much the same way as a 
synthesizer, using such modifiers as: 

■ Velocity 

■ Panning 

■ Expression (modulation and user control variations) 

■ Fow-frequency oscillation (FFO) 

■ Attack, delay, sustain and release (ADSR) and other envelope processing 
parameters 

■ Keyboard scaling 

■ Aftertouch. 

Many sampling systems will often inelude such features as integrated signal 
Processing, múltiple outputs (offering isolated channel outputs for added live 
mixing and signal processing power or for recording individual voices to a 
multitrack recording system) and integrated MIDI sequencing capabilities. 

Sample CDs, DVDs and the Web 

Just as patch data in the form of Sys-Ex dump files can have the effect of breath- 
ing new life into your synth, a wide range of free or commercially available 
samples is commonly available off the Web, from a website, or from a business 
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entity that lets you experiment with loading new and fresh sounds into your 
production system. These files can exist as unedited sound file data (which can 
be imported into any sample system or DAW track), or as data that has been 
specifically programmed by a professional musician/programmer to contain all 
the necessary loops, system commands and sound-generating parameters, so 
that all you ideally need to do is load the sample and start having fun. 

The mind boggles at the range of styles and production quality that has gone 
into producing samples that are just ready and waiting to give your project a 
boost. The general production level literally runs the entire amateur-to-pro 
gamut—meaning that, whenever possible, it's wise to listen to examples to 
determine their quality and to hear how they might fit into your own personal 
or project style before you buy. As a final caveat ... by now, you've probably 
heard of the legal battles that have been raging over sampled passages that have 
been "ripped" from recordings of established artists. In the fall of 2004, in the 
case of Bridgeport Music et al v. Dimensión Films, the 6th Circuit U.S. Court of 
Appeals ruled that the digital sampling of a recording without a license is a 
violation of copyright, regardless of size or significance. This points to the need 
for tender loving care when lifting samples off a CD or the Web. 


The MIDI keyboard controller 

As computers, sound modules, virtual software instruments and other types of 
digital devices have come onto the production scene, it's been interesting to 
note that fewer and fewer devices are being made that inelude a music keyboard 
in their design. As a result, the MIDI keyboard controller has gained in popular- 
ity as a device that might inelude a: 

■ Music keyboard surface 

■ Variable parameter Controls 

■ Fader, mixing and transport Controls 

■ Switching Controls 

■ Tactile trigger and control surfaces. 


A Word About Controllers 


A MIDI controller ¡s a device that’s expressly designed 
to control other devices (be they for sound, light or 
mechanical control) within a connected MIDI system. 

As was previously mentioned, these devices contain no 
intemal tone generators or sound-producing elements 
but often inelude a high-quality control surface and a 
wide range of Controls for handling control, trigger 


and device-switching events. Since controllers have 
become an integral part of music production and are 
available in many incarnations to control and emulate 
many types of musical instrument types, don’t be 
surprised to find controllers of various incarnations 
popping up all over this book and within electronic music 
production. 
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As was stated, these devices contain no internal tone generators or sound- 
producing elements. Instead they can be used in the studio or on the road as a 
simple and straightforward surface for handling MIDI performance, control and 
device-switching events in real time. 

As you might imagine, controllers vary widely in the number of features that 
are offered (Figures 9.38 and 9.39). For starters, the number of keys can vary 
from the sporty, portable 25-key models to those having 49 and 61 keys and all 


FIGURE 9.38 

Novation 25SL USB 
MIDI Controller. 
(Courtesy of Novation 
Digital Music 
Systems, Ltd., www. 
novationmusic.com.) 



FIGURE 9.39 

Yamaha DS6IV PRO 
Disclavier MIDI piano. 
(Courtesy of Yamaha 
Corporation of 
America, www. 
yamaha.com.) 
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the way to the full 88-key models that can play the entire range of a full-size 
grand piano. The keys may be fully or partially weighted and in a number of 
models the keys might be much smaller than the full piano key size—often 
making a performance a bit difficult. Beyond the standard pitch and modula- 
tion wheels (or similar-type controller), the number of options and general 
features is up to the manufacturers. With the increased need for control over 
electronic instruments and music production systems, many model types offer 
up a wide range of physical controllers for varying an ever-widening range of 
expressive parameters. 


Percussion 

Historically speaking, one of the first applications of sample technology made 
use of digital audio to record and play back drum and percussion sounds. Out 
of this virtual miracle sprang a major class of sample and synthesis technology 
that lets an artist (mostly keyboard players) add drum and percussion to their 
own compositions with relative ease. Over the years, MIDI has brought sampled 
percussion within the grasp of every electronic musician, whatever skill level, 
from the frustrated drummer to professional percussionist/programmers—all 
of whom use their skills to perform live and/or to sequenced compositions. 

THE DRUM MACHINE 

In its most basic form, the drum machine uses ROM-based, prerecorded wave- 
form samples to reproduce high-quality drum sounds from its internal memory. 
These factory-loaded sounds often inelude a wide assortment of drum sets, 
percussion sets and rare, wacky percussion hits, and effected drum sets (e.g., 
reverberated, gated). Who knows—you might even encounter scream hits by 
the venerable King of Soul, James Brown. These prerecorded samples can be 
assigned to a series of playable keypads that are generally located on the 
machines top face, providing a straightforward controller surface that often 
sports velocity and aftertouch dynamics. Sampled voices can be assigned to each 
pad and edited using control parameters such as tuning, level, output assign- 
ment and panning position. 

Because of new cost-effective technology, many drum machines now inelude 
basic sampling technology, which allows sounds to be imported, edited and 
triggered directly from the box (Figure 9.40). As with the traditional "beat box," 
these samples can be easily mapped and played from the traditionally styled 
surface trigger pads. Of course, virtual software drum and groove machines 
(Figure 9.41) are part of the present-day landscape and can be used in a stand- 
alone, plugged-in or rewired production environment. 

MIDI DRUM CONTROLLERS 

MIDI drum controllers are used to transíate the voicing and expressiveness of 
a percussion performance into MIDI data. These devices are great for capturing 
the feel of a live performance, while giving you the flexibility of automating or 
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FIGURE 9.40 

Alesis SR-18 stereo 
drum machine. 
(Courtesy of Alesis, 
www.alesis.com.) 



FIGURE 9.41 

BFD acoustic drum 
library module. 
(Courtesy of 
FXpansion, www. 
fxpansioncom.) 
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sequencing a live event. These devices range from having larger pads and trigger 
points on a larger performance surface to drum-machine-type pads/buttons. 
Corning under the "Don't try this at home" category, these controller pads are 
generally too small and not durable enough to withstand drumsticks or mallets. 
For this reason, they're generally played with the fingers. It's long been a popular 
misconception that MIDI drum controllers have to be expensive. This simply 
isn't true. There are quite a few instruments that are perceived by many to be 
toys but, in fact, are fully implemented with MIDI and can be easily used as a 
controller. A few of the ways to perform and sequence percussion inelude: 
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FIGURE 9.42 

Trigger Finger 
16-pad MIDI drum 
control surface. 
(Courtesy of M-Audio, 
a división of Avid 
Technology, Inc., 
www.m-audio.com.) 


■ Drum machine button pads: One of the most straightforward of all drum 
controllers is the drum button pad design that's built into most drum 
machines, portable percussion controllers (Figure 9.42) and certain key- 
board controllers. By calling up the desired setup and voice parameters, 
these small footprint triggers let you go about the business of using your 
fingers to do the walking through a performance or sequenced track. 

■ The keyboard as a percussion controller: Since drum machines respond to 
external MIDI data, probably the most commonly used device for trigger- 
ing percussion and drum voices is a standard MIDI keyboard controller. 
One advantage of playing percussion sounds from a keyboard is that 
sounds can be triggered more quickly because the playing surface is 
designed for fast finger movements and doesn't require full hand/wrist 
motions. Another advantage is its ability to express velocity over the entire 
range of possible valúes (0-127), instead of the limited number of veloc¬ 
ity steps that are available on certain drum pad models. 

■ Drum pad controllers: In more advanced MIDI project studios or live stage 
rigs, it's often necessary for a percussionist to have access to a playing 
surface that can be played like a real instrument. In these situations, a 
dedicated drum pad controller would be better for the job. Drum control¬ 
lers vary widely in design. They can be built into a single, semiportable 
case, often having between six and eight playing pads, or the trigger pads 
can be individual pads that can be fitted onto a special rack, traditional 
drum floor stand or drum set. 

■ MIDI drums: Another way to MIDI-fy an acoustic drum is through the use 
of trigger technology. Put simply, triggering is carried out by using a trans- 
ducer pickup (such as a mic or contract pickup) to change the acoustic 
energy of a percussion or drum instrument into an electrical voltage. Using 
a MIDI trigger device (Figure 9.43), a number of pickup inputs can be 
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FIGURE 9.43 

By using a MIDI 
trigger device (such 
as an Alesis DM5), a 
pickup can be either 
directly replaced or 
sent via MIDI to 
another device or 
sequenced track (a) 
using a mic/ 
instrument source or 
(b) using a recorded 
track as a source. 
(Courtesy of Alesis, 
www.alesis.com.) 




translated into MIDI so as to trigger programmed sounds or samples from 
an instrument for use on stage or in the studio. 

OTHER MIDI INSTRUMENT AND CONTROLLER TYPES 

There are literally tons of instruments and controller types out there that are 
capable of translating a performance or general body movements into MIDI. 
You'd be surprised what you'll find searching the Web for wild and wacky 
controllers—both those that are commercially available and those that are made 
by soldering iron junkies. A few of the more traditional controllers inelude MIDI 
guitars and basses, wind controllers, MIDI vibraphones ... the list goes on. 

SEQUENCING 

Apart from our computers, DAWs and venerable electronic instruments, one of 
the most important tools that can be found in the modern-day project studio 
is the MIDI sequencer. Basically, a sequencer is a digital device or software 
application that's used to record, edit and output MIDI messages in a sequential 
fashion. These messages are generally arranged in a track-based format that 
follows the modern production concept of having instruments (and/or instru¬ 
ment voices) located on sepárate tracks. This traditional interface makes it easy 
for us humans to view MIDI data as tracks on a digital audio workstation (DAW) 
or analog tape recorder that follow along a straightforward linear time line. 

These tracks contain MIDI-related performance and control events that are 
made up of such channel and system messages as Note-On, Note-Off, Velocity, 
Modulation, Aftertouch and Program/Continuous Controller messages. Once a 
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performance has been recorded into a sequencer s memory, these events can be 
graphically arranged and edited into a musical performance. The data can then 
be saved as a file or within a DAW session and recalled at any time, allowing 
the data to be played back in its originally recorded or edited order. 

Because most sequencers are designed to function in a way that's similar to the 
multitrack tape recorder, they give us a familiar operating environment in which 
each instrument, set of layered instruments or controller data can be recorded 
onto sepárate, synchronously arranged tracks. Like its multitrack cousin, each 
track can be re-recorded, erased, copied and varied in level during playback. 
However, since the recorded data is digital in nature, a MIDI sequencer is far 
more flexible in its editing speed and control in that it offers all the cut-and- 
paste, signal processing and channel-routing features that we've come to expect 
from a digital production application. 

Integrated workstation sequencers 

A type of keyboard synth and sampler system known as a keyboard workstation 
will often inelude much of the necessary production hardware that's required 
for music production, including effeets and an integrated hardware sequencer. 
These Systems have the advantage of letting you take your instrument and 
sequencer on the road without having to drag your whole system along. Similar 
to the stand-alone hardware sequencer, a number of these sequencer systems 
have the disadvantage of offering few editing tools beyond transport functions, 
punch-in/out commands, and other basic edit functions. With the advent of 
more powerful keyboard systems that inelude a larger, integrated LCD display, 
the integrated sequencers within these systems are becoming more powerful, 
resembling their software sequencing counterparts. In addition, other types of 
palm-sized sequencers offer such features as polyphonic synth voices, drum 
machine kits, effeets, MIDI sequencing and, in certain cases, facilities for record- 
ing multitrack digital audio in an all-in-one package that literally fits in the 
palm of your hand! 

Software sequencers 

By far, the most common sequencer type is the software sequencing program 
(Figure 9.44). These programs run on all types of personal and laptop comput- 
ers and take advantage of the hardware and software versatility that only a 
Computer can offer in the way of speed, hardware flexibility, memory manage- 
ment, signal routing and digital signal processing. Software sequencers 
offer several advantages over their hardware counterparts. Here are just a few 
highlights: 

■ Increased graphics capabilities (giving us direct control over track and 
transport-related record, playback, mix and processing functions) 

■ Standard Computer cut-and-paste edit capabilities 

■ Ability to easily change note and controller valúes, one note at a time or 
over a defined range 
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FIGURE 9.44 

MIDI edit window 
within Cubase SX 
audio production 
software. (Courtesy 
of Steinberg Media 
Technologies GmbH, 
a división of Yamaha 
Corporation, www. 
steinberg.net.) 
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■ A window-based graphic environment (allowing easy manipulation of 
program and edit-related data) 

■ Easy adjustment of performance timing and tempo changes within a 
session 

■ Powerful MIDI routing to múltiple ports within a connected system 

■ Graphic assignment of instrument voices via Program Change messages 

■ Ability to save and recall files using standard Computer memory media. 

Basic introduction to sequencing 

When dealing with any type of sequencer, one of the most important concepts 
to grasp is that these devices don't store sound directly; instead, they encode 
MIDI messages that instruct instruments as to what note is to be played, over 
what channel, at what velocity and at what, if any, optional controller valúes. 
In other words, a sequencer simply stores command instructions that follow in 
a sequential order. These instructions tell instruments and/or devices how their 
voices are to be played or controlled. This means that the amount of encoded 
data is a great deal less memory intensive than its digital audio or digital video 
recording counterparts. Because of this, the data overhead that's required by 
MIDI is very small, allowing a computer-based sequencer to work simultane- 
ously with the playback of digital audio tracks, video images, Internet browsing, 
etc., all without unduly slowing down the computer's CPU. For this reason, 
MIDI and the MIDI sequencer provide a media environment that plays well 
with other computer-based production media. 

RECORDING 

Commonly, a MIDI sequencer is an application within a digital production 
workspace for creating personal compositions in environments that range from 
the bedroom to more elabórate professional and project studios. Whether hard- 
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ware or software based, most sequencers use a working interface that's roughly 
designed to emulate a traditional multitrack-based environment. A tape-like set 
of transport Controls lets us move from one location to the next using standard 
Play, Stop, Fast Forward, Rewind and Record command buttons. Beyond using 
the traditional Record-Enable button to select the track or tracks that we want 
to record onto, all we need to do is select the MIDI input (source) port, output 
(destination) port, MIDI channel, instrument patch and other setup require- 
ments. Then press the record button and begin laying down the track. 


D I Y 

do it yourself 


Tutorial: Setting Up a Session 

1. Pulí out a favorite MIDI Instrument or cali up a 
favorite plug-in instrument. 

2. Route the instrument’s MIDI and audio cables to their 
appropriate workstation and audio mixer devices 
according to the situation at hand. 

3. Cali up a MIDI sequencer (either by beginning 
a DAW session or by using a hardware 
sequencer) and create a MIDI track that can be 
recorded to. 

4. Set the session to a tempo that feels right for the 
song. 

5. Assign the track’s MIDI input to the port that’s 
receiving the incoming MIDI data. 


and Laying Down a MIDI Track 

6. Assign the track’s MIDI output to the port and proper 
MIDI channel that’ll be receiving the outgoing MIDI 
data during playback. 

7. If a click track is desired, turn it on. 

8. Ñame the track. This will make it easier to identify the 
MIDI passage in the future. 

9. Place the track into the Record Ready mode. 

10. Play the instrument or controller. Can you hear the 
instrument? Do the MIDI activity indicators light up 
on the sequencer, track and MIDI interface? If not, 
check your cables and run through the checklist 
again. If so, press Record and start laying down your 
first track. 


Once you've finished laying down a track, you can jump back to the beginning 
of the recorded passage and listen to it. From this point, you could then "arm" 
(a term used to denote placing a track into the record-ready mode) the next 
track and go about the process of laying down additional tracks until a song 
begins to form. 

Setting the session tempo 

When beginning a MIDI session, one of the first 
aspects to consider is the tempo and time sig- 
nature. The beats-per-minute (bpm) valué will 
set the general tempo speed for the overall 
session. This is important to set at the beginning 
of the session, so as to lock the overall "bars and 
beats" timing elements to this initial speed that's 


To avoid any number of 
unforeseen obstacles to a 
straightforiuard production, it’s often 
mise to set your session tempo before 
pressing any record buttons. 
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often essential in electronic music production. This tempo/click element 
can then be used to lock the timing elements of other Instruments and/or 
rhythm machines to the session (e.g., a drum machine plug-in can be pulled 
into the session that'll automatically be locked to the session's speed and 
timing). 

Changing tempo 

The tempo of a MIDI production can often be easily changed without worrying 
about changing the program's pitch or real-time control parameters. In short, 
once you know how to avoid potential conflicts and pitfalls, tempo variations 
can be made after the fact with relative ease. Basically, all you need to do is 
alter the tempo of a sequence (or part of a sequence) to best match the 
overall feel of the song. In addition, the tempo of a session can be dynamically 
changed over its duration by creating a tempo map that causes the speed to 
vary by defined amounts at specific points within a song. Care and preplan- 
ning should be exercised when a sequence is synced to another media form 
or devi ce. 

Clicktrack 

When musical timing is important (as is often the case in modern music and 
visual media production), a click track can be used as a tempo guide for keeping 
the performance as accurately on the beat as possible. A click track can be set 
to make a distinctive sound on the measure boundary or (for a more accurate 
timing guide) on the first beat boundary and on subsequent meter divisions 
(e.g., tock, tick, tick, tick, tock, tick, tick, tick, ...). Most sequencers can output 

a click track by either using a dedicated beep sound (often 
outputting from the device or main speakers) or by 
sending Note-On messages to a connected instru- 
ment in the MIDI chain. The latter lets you use 
any sound you want and often at definable 
velocity levels. For example, a kick could sound 
on the beat, while a snare sounds out the 
measure divisions. A strong reason for using a 
click track (at least initially) is that it serves as a 
rhythmic guide that can improve the timing accu- 
racy of a performance. 

The use of a click track is by no means a rule. In certain instances, it can 
lead to a performance that sounds stiff. For compositions that loosely flow 
and are legato in nature, a click track can stifle the passage's overall feel 
and flow. As an alternative, you could turn the metronome down, have it sound 
only on the measure boundary, and then listen through one headphone. 
As with most Creative decisions, the choice is up to you and your current 
circumstance. 


Care should be taken in setting the 
proper time signature at the session’s 
outset. Listening to a 4/4 click can be 
disconcerting when the song is being 
performed in 3/4 time. 
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Múltiple track recording 

Although only one MIDI track is commonly recorded at a time, most mid- and 
professional-level sequencers allow us to record múltiple tracks at one time. 
This feature makes it possible for a multi-trigger instrument or for several per- 
formers to record to a sequence in one, live pass. For example, such an arrange- 
ment would allow for each trigger pad of a MIDI drum controller to be recorded 
to its own track (with each track being assigned to a different MIDI channel on 
a single port). Alternatively, several instruments of an on-stage electronic band 
could be captured to a sequence during a live performance and then laid into 
a DAW session for the making of an álbum project. 

Punching in and out 

Almost all sequencers are capable of punching in and out of record while 
playing a sequence (Figure 9.45). This commonly used function lets you 
drop in and out of record on a selected track (or series of tracks) in real 
time, in a way that mimics the traditional multitrack overdubbing process. 
Although punch-in and punch-out points can often be manually performed 
on the fly, most sequencers can automatically perform a punch by graphically 
or numerically entering in the measure/beat points that mark the punch-in 
and punch-out location points. Once done, the sequence can be rolled back 
to a point a few measures before the punch-in point and the artist can play 
along while the sequencer automatically performs the necessary switching 
functions. 



FIGURE 9.45 

Steinberg Cubase/ 
Nuendo punch-in and 
punch-out Controls. 
(Courtesy of 
Steinberg Media 
Technologies GmbH, 
a división of Yamaha 
Corporation, www. 
steinberg.net.) 
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Tutorial: Punching During a Take 


1. Set up a track and record a musical passage. Save 
the session to disk. 

2. Roll back to the beginning of the take and play along. 
Manually punch ¡n and out during a few bars. Was that 
easy? Difficult? Now, undo or revert back to your 
originally saved session. 

3. Read your DAW/sequencer manual and leam how 
to perform an automated punch (placing your 


punch-in and punch-out points at the same place as 
before). 

4. Roll back to a point a few bars before the punch and 
go into the record mode. Did the track automatically 
place itself into record? Was that easier than doing it 
manually? 

5. Feel free to try other features, such as record looping 
or stacking. 


Step time entry 

In addition to laying down a performance track in real time, most sequencers 
will allow us to enter note valúes into a sequence one note at a time. This feature 
(known as step time, step input or pattern sequencing) makes it possible for 
notes to be entered into a sequence without having to worry about the exact 
timing. lipón playback, the sequenced pattern will play back at the session's 
original tempo. Fact is, step entry can be an amazing tool, allowing a difficult 
or a blazingly fast passage to be meticulously entered into a pattern and then 
be played out or looped with a degree of technical accuracy that would other- 
wise be impossible for most of us to play. Quite often, this data entry style is 
used with fast, high-tech musical styles where real-time entry just isn't possible 
or accurate enough for the song. 
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Tutorial: Is It Live or Is It MIDI? 


For those who want to dive into a whole new world of 
experimentation and sonic surprises, here’s a new adage: 
“If the ¡nstrument supports MIDI, record the performance 
data to a DAW track.” For example: 

■ You might record the MIDI out of a keyboard perfor¬ 
mance to a DAW MIDI track. If there’s a problem in the 
performance you can simply change the note and 
re-record it—or, if you want to change the sound, 
simply pick another sound. 


■ Record the sequenced track from a triggered drum set 
or controller to a DAW MIDI track. If you do a surround 
mix at a later time, simply re-record each drum part to 
new tracks and pan the drums around the soundfield. 

■ If a MIDI guitar riff needs some tweaking to fill out the 
sound, double the MIDI track an octave down. 

■ Acoustic drum recordings can benefit from MIDI by 
using a trigger device that can accept audio from a mic 
or recorded track and output the triggered MIDI 
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messages to a sampler or instrument, ¡n order to 
replace the bad tracks with samples that rock the 
house. By the way, don’t forget to record the trigger 
outputs to a MIDI track on a DAW or sequencer, just ¡n 
case you want to edit or change the sounds at a later 


time. Alternatively, you could even use replacement 
trlggering software to replace or augment poorly 
recorded acoustic tracks with killer sounds. 

It’s all up to you ... as you might imagine, surprises can 
definitely come from experiments like these. 


SAVING YOUR MIDI FILES 

Just as it's crucial that we carefully and methodically back up our program and 
production media, it's important that we save our MIDI and session files while 
we're in production. This can be done in two ways: 

■ Update your files by periodically saving files over the course of a produc¬ 
tion. A number of programs can be set up to automatically perform this 
function at regular intervals. 

■ At important points throughout a production, you might choose to save 
your session files under new and unique ñames, thereby making it pos- 
sible to easily revert back to a specific point in the production. This can 
be an important recovery tool should the session take a wrong turn or for 
re-creating a specific effect and/or mix. 


When working with MIDI within a DAW session, it’s 
ALWAYS a good ¡dea to save the original MIDI tracks 
within the session. This makes it easy to go back and 


change a note, musical key or sounding voice or to make 
any other alterations you want. Not saving these files 
could lead to some major headaches or worse. 


MIDI files can also be converted to and saved as a standard MIDI file for use in 
exporting to and importing from another MIDI program or for distributing 
MIDI data for use on the Web, in cell phones, etc. These files can be saved in 
either of two formats: 

■ Type 0: Saves all of the MIDI data within a session as a single MIDI track. 
The original MIDI channel numbers will be retained. The imported data 
will simply exist on a single track. Note that if you save a multi-instrument 
session as a Type 0, you'll loose the ability to save the MIDI data to discrete 
tracks within the saved sequence. 

■ Type 1: Saves all of the MIDI data within a session onto sepárate MIDI 
tracks that can be easily imported into a sequencer in a multitrack fashion. 

EDITING 

One of the more important features that a sequencer (or MIDI track within a 
DAW) has to offer is its ability to edit sequenced tracks or blocks within a track. 
Of course, these editing functions and capabilities often vary from one sequencer 


Editing 


to another. The main track window of a sequencer or MIDI track on a DAW is 
used to display such track information as the existence of track data, track 
ñames, MIDI port assignments for each track, program change assignments, 
volume controller valúes and other transport commands. 

Depending on the sequencer, the existence of MIDI data on a particular track 
at a particular measure point (or over a range of measures) is indicated by the 
highlighting of a track range in a way that's extremely visible. For example, in 
Figure 9.46, you'll notice that the gray MIDI tracks contain graphical bar display 
information. This means that these measures contain MIDI messages, while the 
nonhighlighted areas don't. 

By navigating around the various data display and parameter boxes, it's possible 
to use cut-and-paste and/or edit techniques to vary note valúes and parameters 
for almost every facet of a section or musical composition. For example, let's 
say that we really screwed up a few notes when laying down an otherwise killer 
bass rifif. With MIDI, fixing the problem is totally a no-brainer. Simply highlight 
each fudged note and drag it to its proper note location—we can even change 
the beginning and endpoints in the process. In addition, tons of other param¬ 
eters can be changed, including velocity, modulation and pitch bend, note and 
song transposition, quantization and humanizing (factors that eliminate or 
introduce human timing errors that are generally present in a live performance), 


FIGURE 9.46 

ProTools showing the 
main edit screen. 
(Courtesy of 
Digidesign, a división 
of Avid Technology, 
Inc., www.digidesign. 
com.) 
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in addition to full control over program and continuous controller messages ... 
the list goes on. 

Practical editing techniques 

When it comes to learning the Ins, Outs and Thrus of basic sequencing, abso- 
lutely nothing can take the place of diving in and experimenting with your 
setup. Here, I'd like to quote Craig Anderton, who said: "Read the manual 
through once when you get the program (or device), then play with the software 
and get to know it before you need it. Afterwards, reread the manual to pick up 
the system's finer operating points." Wise words! 

In this section, we'll be covering some of the basic techniques that'll speed you 
on your way to sequencing your own music. Note that there are no rules to 
sequencing MIDI. As with all of music production (and the arts, for that matter), 
there are as many right ways to perform and sequence music as there are musi- 
cians. Just remember that there are no hard and steadfast rules to music pro¬ 
duction—but there are definitely guidelines, tools and tips that can speed and 
impro ve the process. 

TRANSPOSITION 

As was mentioned earlier, a sequencer app is capable of altering individual notes 
in a number of ways including pitch, start time, length and controller valúes. 
In addition, it's generally a simple matter for a defined range of notes in a 
passage to be altered as a single entity in ways that could alter the overall key, 
timing and controller processes. Changing the pitch of a note or the entire key 
of a song is extremely easy to do with a sequencer. Depending on the system, 
a song can be transposed up or down in pitch at the global level, thereby affect- 
ing the musical key of a song. Likewise, a segment can be shifted in pitch from 
the main edit, piano roll or notation edit Windows by simply highlighting the 
bars and tracks that are to be changed and then calling up the transpose func- 
tion from the program menú. 

QUANTIZATION 

By far, most common timing errors begin with the performer. Fortunately, "to 
err is human," and standard performance timing errors often give a piece a live 
and natural feel. However, for those times when timing goes beyond the bounds 
of nature, an important sequencing feature known as quantization can help 
correct these timing errors. Quantization allows timing inaccuracies to be 
adjusted to the nearest desired musical time división (such as a quarter, eighth, 
or sixteenth note). For example, when performing a passage where all involved 
notes must fall exactly on the quarter-note beat, it's often easy to make timing 
mistakes (even on a good day). Once the track has been recorded, the problem- 
atic passage can be highlighted and the sequencer can recalculate each notes 
start and stop times so they fall precisely on the boundary of the closest time 
división. 


Editing 


HUMANIZING 

The humanization process is used to randomly alter all of the notes in a selected 
segment according to such parameters as timing, velocity and note duration. 
The amount of randomization can often be limited to a user-specified valué or 
percentage range, and parameters and can be individually selected or fine-tuned 
for greater control. Beyond the obvious advantages of reintroducing human-like 
timing variations back into a track, this process can help add expression by 
randomizing the velocity valúes of a track or selected tracks. For example, 
humanizing the velocity valúes of a percussion track that has a limited dynamic 
range can help bring it to life. The same type of life and human swing can be 
effective on almost any type of instrument. 

SLIPPING INTIME 

Another timing variable that can be introduced into a sequence to help change 
the overall feel of a track is the slip time feature. Slip time is used to move a 
selected range of notes either forward or backward in time by a defined number 
of dock pulses. This has the obvious effect of changing the start times for these 
notes, relative to the other notes or timing elements in a sequence. 

Editing controller valúes 

Almost every sequencer package allows controller message valúes to be edited 
or changed, and they often provide a simple, graphic window whereby a line 
or freeform curve can be drawn that graphically represents the effect that rele- 
vant controller messages will have on an instrument or voice. By using a mouse 
or other input device, it becomes a simple matter to draw a continuous stream 
of controller valúes that correspondingly change such variables as velocity, 
modulation, pan, etc. To physically change parameters, all you need to do is 
twiddle a knob, move a fader or graphically draw the variables on-screen in a 
WYSIWYG ("what you see is what you get") fashion. 


Some of the more common controller valúes that can affect a sequence and/or MIDI mix valúes inelude (for the full 
listing of controller ID numbers, see Table 9.1 earller in this chapter): 


Control # 


Parameter 
Modulation Wheel 
Breath Controller 
Foot Controller 

Channel Volume (formerly Main Volume) 

Balance 

Pan 

Expression Controller 

Damper Pedal on/off (Sustain) <63 off, >64 on 


2 

4 

7 

8 

10 

11 

64 
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It almost goes without saying that a range of controller events can be altered 
on one or more tracks by allowing a range of MIDI events to be highlighted 
and then altered by entering in a parameter or processing function from an edit 
dialog box. This ability to define a range of events often comes in handy for 
making changes in pitch/key, velocity, main volume and modulation (to ñame 
a few). 


0 I Y 1 

do it yourself 

Tutorial: Changing 

1. Read your DAW/sequencer manual and leam ¡ts 
basic controller editing features. 

2. Open or create a MIDI track. 

3. Select a range of measures and change their Channel 
Volume settings (controller 7) over time. Does the 
output level change over the segment? 

4. Highlight the segment and reduce the overall Channel 
Volume levels by 25%. 

5. Take the segment’s varying controller settings and 
set them to an absolute valué of 95. Did that 
elimínate the level differences? 

6. Now, refer to your DAW/sequencer manual for how 
to scale MIDI controller events over time. 


Controller Valúes 

7. Again, rescale the Channel Volume settings so they 
vary widely over the course of the segment. 

8. Highlight the segment and scale the velocity valúes 
so they have a mínimum valué of 64 and a máximum 
of 96. Could you see and hear the changes? 

9. Again, highlight the segment and instruct the 
software to fade ¡t from ¡ts current valué to an ending 
valué of 0. Hopefully, you’ve just created a Channel 
Volume fade. Did you see the MIDI channel fader 
move? 

10. Undo and start the fade with an initial valué of 0 and 
a current valué of 100% ending. Did the segment 
fade in? 


PLAYBACK 

Once a sequence is composed and saved to disk, all of the sequence tracks can 
be transmitted through the various MIDI ports and channels to the instruments 
or devices to make music, create sound effects for film tracks or control device 
parameters in real time. Because MIDI data exists as encoded real-time control 
commands and not as audio, you can listen to the sequence and make changes 
at any time. You could change the patch voices, alter the final mix or change 
and experiment with such controllers as pitch bend or modulation—even 
change the tempo and key signature. In short, this médium is infinitely flexible 
in the number of versions that can be created, saved, folded, spindled and 
mutilated until you've arrived at the overall sound and feel you want. Once 
done, you'll have the option of using the data for live performance or mixing 
the tracks down to a final recorded media, either in the studio or at home. 

During the summer, in a wonderful small-town tavern in the city where I live, 
there's a frequent performer who'll wail the night away with his voice, trusty 
guitar and a backup band that consists of several electronic synth modules and 
a laptop PC/sequencer that's just chock-full of country-'n-western sequences. 


Transferring MIDI to Audio Tracks 


His set of songs for the night is loaded into a song playlist feature thaté pro- 
grammed into his sequencer. LJsing this playlist, he queues his sequences so 
that when he's finished one song, taken his bow, introduced the next song and 
complimented the lady in the red dress, all he needs to do is press the space 
bar and begin playing the next song. Such is the life of an on-the-road sequencer. 

In the studio, it's become more the rule than the exception that MIDI tracks 
will be recorded and played back in sync with a DAW, analog multitrack machine 
or digital multitrack. As you're aware, using a DAW-based sequencer automati- 
cally integrates the world of audio and video with the MIDI production environ- 
ment. Whenever more than one playback médium is involved in a production, 
a process known as synchronization is required to make sure that events in the 
MIDI, analog, digital and video media occur at the same point in time. Locking 
events together in time can be accomplished in various ways (depending on the 
equipment and media type used). 

TRANSFERRING MIDI TO AUDIO TRACKS 

When mixing down a session that contains MIDI tracks, many folks prefer not 
to mix the sequenced tracks in the MIDI domain. Instead, they'll often export 
it to an audio track within the project. Here are a few helpful hints that can 
make this process go more smoothly: 

■ Set the main volume and velocities to a "reasonably" high output level, 
much as you would with any recorded audio track. 

■ Solo the MIDI and instrumenté audio input track and take a listen, 
making sure to turn off any reverb or other effects that might be on that 
instrument track (you might need to pulí out your instrumenté manual 
to flip through these settings). If you really like the instrument effect, of 
course, go ahead and record it; however, you might consider recording the 
track both with and without effects. 

■ If any mix-related moves have been programmed into the sequence, you 
"might" want to strip out volume, pan and other controller messages 
before exporting the track. This can be done manually; however, most 
DAW/sequencer packages inelude features for stripping controller data 
from a MIDI track. 

■ If the instrument has an acoustic (such as a MIDI grand piano or MIDI 
guitar room/amp setup) or multichannel output component, you might 
consider recording the instrument to stereo or surround tracks. This will 
allow for greater flexibility in creating ambient "space" within the final mix. 


Important Note 

It’s always wise to save the original MIDI track or file files will often limit your future options or result in major 

within the session. This makes future changes in the production headaches down the road. 

composition infinitely easier. Failure to save your MIDI 
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MIXING A SEQUENCE 

Almost all DAW and sequencer types will let you mix a sequence in the MIDI 
domain using various controller message types. This is usually done by creating 
a software interface that incorporates these Controls into a virtual on-screen 
mixer environment that often integrates with the main DAW mix screen. Instead 
of directly mixing the audio signáis that make up a sequence, these Controls are 
able to directly access such track controllers as Main Volume (controller 7), Pan 
(controller 10), and Balance (controller 8), most often in an environment that 
completely integrates into the workstation's overall mix Controls. Since the mix- 
related data is simply MIDI controller messages, an entire mix can be easily 
stored within a sequence file. Therefore, even with the most basic sequencer, 
you'll be able to mix and remix your sequences with complete automation and 
total settings recall whenever a new sequence is opened. As is almost always the 
case with a DAW's audio and MIDI graphical user interface (GUI), the control¬ 
ler and mix interface will almost always have moving faders and variable Con¬ 
trols (Figure 9.47). 


MUSIC PRINTING PROGRAMS 

Over the past few decades, the field of transcribing musical scores and arrange- 
ments has been strongly affected by both the Computer and MIDI technology. 
This process has been greatly enhanced through the use of newer generations 
of Computer software that make it possible for music notation data to be entered 
into a Computer either manually (by placing the notes onto the screen via 
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FIGURE 9.47 

MIDI tracks can be 
added into the mix 
for real-time 
parameter control 
over hardware and/or 
software devices. 
(Courtesy of 
Digidesign, a división 
of Avid Technology, 
Inc., www.digidesign. 
com.) 














































































































































































Music Printing Programs 


FIGURE 9.48 

Finale 2009 music 
printing program. 
(Courtesy of 
MakeMusic, Inc., 
www.finalemusic. 
com.) 
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keyboard or mouse movements), by direct MIDI input or by sheet music scan- 
ning technology. Once entered, these notes can be edited in an on-screen envi- 
ronment that lets you change and configure a musical score or lead sheet using 
standard cut-and-paste editing techniques. In addition, most programs allow 
the score data to be played directly from the score by electronic instruments via 
MIDI. A final and important program feature is their ability to quickly print out 
hard copies of a score or lead sheets in a wide number of print formats and 
styles. 

A music printing program (also known as a music notation program) lets you 
enter musical data into a computerized score in a number of manual and auto- 
mated ways (often with varying degrees of complexity and ease). Programs of 
this type (Figure 9.48) offer a wide range of notation symbols and type styles 
that can be entered either from a Computer keyboard or mouse. In addition to 
entering a score manually, most music transcription programs will generally 
accept MIDI input, allowing a part to be played directly into a score. This can 
be done in real time (by playing a MIDI instrument/controller or finished 
sequence into the program) or in step time (by entering the notes of a score 
one note at a time from a MIDI controller) or by entering a standard MIDI file 
into the program (which uses a sequenced file as the notation source). 

In addition to dedicated music printing programs, most DAW or sequencer 
packages will often inelude a basic music notation application that allows the 
sequenced data within a track or defined región to be displayed and edited 
directly within the program (Figures 9.49), from which it can be printed in a 
limited score-like fashion. However, a number of high-level workstations offer 
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FIGURE 9.49 

Steinberg Cubase/ 
Nuendo score 
window. (Courtesy of 
Steinberg Media 
Technologies GmbH, 
a división of Yamaha 
Corporation, www. 
steinberg.net.) 


scoring features that allow sequenced track data to be notated and edited in a 
professional fashion into a fully printable music score. 

As you might expect, music printing programs will often vary widely in their 
capabilities, ease of use and offered features. These differences often center 
around the graphical user interface (GUI), methods for inputting and editing 
data, the number of instrumental parts that can be placed into a score, the 
overall selection of musical symbols, the number of musical staves (the lines 
that music notes are placed onto) that can be entered into a single page or 
overall score, the ability to enter text or lyrics into a score, etc. As with most 
programs that deal with artistic production, the range of choices and general 
functionality reflect the style and viewpoints of the manufacturen so care should 
be taken when choosing a professional music notation program, to see which 
one would be right for your personal working style. 
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CHAPTERÍO 


Multimedia and the Web 


It's no secret that modern-day computers have gotten faster, sleeker and sexier 
in their overall design. In addition to their ability to act as a multifunctional 
production workhorse, one of the crowning achievements of modern computers 
is the degree of media and networking integration that has worked its way into 
our collective consciousness in a way that has come to be universally known as 
the household buzzword multimedia. 

The combination of working and playing with multimedia has found its way 
into modern Computer culture through the use of various hardware and soft¬ 
ware Systems that work in a multitasking environment and combine to bring 
you a unified experience that seamlessly involves such media types as: 

■ Text 

■ Graphics 

■ Audio and music 

■ Computer animation 

■ Musical instrument digital interface (MIDI) 

■ Video. 

The obvious reason for creating and integrating these media types is the human 
desire to share and communicate one's experiences with others. This has been 
done for centuries in the form of books and, in relatively more recent decades, 
through movies and televisión. In the here and now, the amazingly powerful 
and versatile presence of the Web can be added to this Communications list. 
Nothing allows individuáis and corporate entities alike to reach millions so 
easily. Perhaps most importantly, the Web is a multimedia experience that each 
individual can manipúlate, learn from and even respond to in an interactive 
fashion. The Web has indeed unlocked the potential for experiencing multime¬ 
dia events and information in a way that makes each of us a participant—not 
just a passive spectator. To me, this is the true revolution occurring at the dawn 
of the 21st century! 

THE MULTIMEDIA ENVIRONMENT 

When you get right down to it, multimedia is nothing more than a unified 
programming and operating system (OS) environment that allows múltiple 



The Multimedia Environment 


forms of program data and contení media to simultaneously stream and be 
routed to the appropriate hardware ports for output, playback and/or processing 
(Figure 10.1). 

The two most important concepts behind this environment are: 

■ Multitasking 

■ The device driver. 

Basically, multitasking can be thought of as a modern-day form of illusion. Just 
as a magic trick can be quickly pulled off with sleight of hand or a film that 
switches frames 24 times each second can create the illusion of continuous 
movement, the multimedia environment deceives us into thinking that all of 
the sepárate program and media types are working at the same time. In reality, 
the Computer uses multitasking to quickly switch from one program to the next 
in a cyclic fashion. Similar to the film example, newer Computer systems have 
gotten so lightning fast at cycling between programs and applications that they 
give the illusion they're all running at the same time. 

Another concept central to multimedia is that of the device driver. Briefly, a 
driver acts as a device-specific software patch cord that routes media data from 
the source application to the appropriate hardware output device (also known 
as a port) and from a port back to the application's input (Figure 10.2). Thus, 


FIGURE 10.1 

Example of a 
multimedia program 
environment. 
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software application driver 



hardware l/O 


FIGURE 10.2 

Basic interaction 
between a software 
application and a 
hardware device via 
the device driver. 


whenever any form of media playback is requested, it'll be recognized as being 
a particular data type and will be routed to the appropriate device driver and 
finally sent out to the selected output port (or ports). 

DELIVERY MEDIA 

Although media data can be stored and/or transmitted over a wide range of 
media storage devices, the most commonly found delivery media at the time 
of this writing are the: 

■ CD 

■ DVD 

■ Web. 

The CD 

One of the most important developments in the mass marketing and distribu- 
tion of large amounts of media is the compact disc (CD), both in the form of 
the CD-Audio and the CD-ROM. As most are aware, the CD-Audio disc is 
capable of storing up to 74 minutes of audio at a rate of 44.1 kHz/16 bits. Its 
cióse optical cousin, the CD-ROM, is capable of storing up to 700 MB of graph- 
ics, video, digital audio, MIDI, text, and raw data. Consequently, these pre- 
manufactured and user-encoded media are able to store large amounts of music, 
text, video, graphics, etc., to such an interactive extent that this médium has 
become a driving forcé among all Communications media. Table 10.1 details 
the various CD standards that are currently in use. 

The DVD 

Similar to their optical cousins DVDs (which, after a great deal of industry 
deliberation, simply stands for "DVD") can contain any form of data, 
llnlike CDs, these dises are capable of storing a whopping 4.7 gigabytes 
(GB) within a single-sided disc and 8.5 GB on a double-layered disc. This capac- 
ity makes the DVD the perfect delivery médium for encoding video in the 
MPEG-2 encoding format, data-intensive games, DVD-Audio and numerous 
DVD-ROM titles. The increased demand for multimedia games, educational 
producís, etc., has spawned the computer-related industry of CD and DVD-ROM 
authoring. The term authoring refers to the Creative, design, and programming 









Delivery Media 


Table 10.1 

CD Format Standards 

Format 

Description 

Red Book 

Audio-only standard; also called CD-A (Compact Disc Audio) 

Yellow Book 

Data-only format; used to write/read CD-ROM data 

Green Book 

CD-I (Compact Disc Interactive) format; never gained mass popularity 

Orange Book 

CD-R (Compact Disc Recordable) format 

White Book 

VCD (Video Compact Disc) format for encoding CD-A audio and MPEG-1 or 
MPEG-2 video data; used for home video and karaoke 

Blue Book 

Enhanced Musió CD format (also known as CD Extra or CD+) can contain 
both CD-A and data 

ISO-9660 

A data file format that’s used for encoding and reading data from CDs of all 
types across platforms 

Joliet 

Extensión of the ISO-9660 format that allows for up to 64 characters in its file 
ñame (as opposed to the 8 file + 3 extensión characters allowed by MS-DOS) 

Romeo 

Extensión of the ISO-9660 format that allows for up to 128 characters in the 
file ñame 

Rock Ridge 

Unix-style extensión of the ISO-9660 format that allows for long file ñames 

CD-ROM/XA 

Allows for extended usage for the CD-ROM format—Mode-1 is strictly Yellow 
Book, while Mode-2 Form-1 ineludes error correction and Mode-2 Form-2 
doesn’t allow for error correction; often used for audio and video data 

CD-RFS 

Incremental packet writing system from Sony that allows data to be written 
and rewritten to a CD or CD-RW (in a way that appears to the user much like 
the writing/retrieval of data from a hard drive) 

CD-UDF 

UDF (Universal Disc Format) is an open incremental packet writing system 
that allows data to be written and rewritten to a CD or CD-RW (in a way that 
appears to the user much like the writing/retrieval of data from a hard drive) 
according to the ISO-13346 standard 

HDCD 

The High-Definition Compatible Digital system adds 6 dB of gain to a Red 

Book CD (when played back on an HDCD-compatible player) through the use 
of a special companion mastering technique 

Macintosh FIFS 

An Apple file system that supports up to 31 characters in a file ñame; 
ineludes a data fork and a resource fork that identify which application should 
be used to open the file 


aspeets of putting together a CD/DVD project. At its most basic level, a project 
can be authored, mastered and burned to disc from a single commercial author- 
ing program. Whenever the stakes are higher, trained professionals and expen- 
sive systems are often called in to assemble, master and produce the final disc 
for mass duplication and sales. Table 10.2 details the various DVD video/audio 
formats currently in use. 
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Table 10.2 DVD Video/Audio Formáis 

Format 

Sample 

Rate 

(kHz) 

Bit Rate 

Bit/s 

Ch 

Common 

Format 

Compression 

PCM 

48, 96 

16, 20, 

24 

Up to 6.144 Mbps 

1 to 8 

48 kHz, 16 bit 

None 

AC3 

48 

16, 20, 

24 

64 to 448 kbps 

1 to 6.1 

192 kbps, 
stereo 

AC3 and 

384 kbps, 

448 kbps 

DTS 

48, 96 

16, 20, 

24 

64 to 1536 kbps 

1 to 7.1 

377 or 

754 kbps for 
stereo and 
754.5 or 
1509.25 kbps 
for 5.1 

DTS coherent 
acoustics 

MPEG-2 

48 

16, 20 


32 to 912 kbps 

1 to 7.1 

Seldom used 

MPEG 

MPEG-1 

48 

16, 20 


384 kbps 

2 

Seldom used 

MPEG 

SDDS 

48 

16 


Up to 1289 kbps 

5.1, 7.1 

Seldom used 

ATRAC 


The Web 

One of the most powerful aspects of multimedia is the ability to communicate 
experiences either to another individual or to the masses. For this, you need 
some kind of network connection. The largest and most common network that 
can be found in the home, studio, office or classroom is a connection to the 
Internet. 

Here's the basic gist of how the Internet works: 

■ The Internet (Figure 10.3) can be thought of as a Communications network 
that allows your Computer (or connected network) to be connected to an 
Internet Service Provider (ISP) server (a specialized Computer or cluster of 
ISP computers designed to handle, pass and route data between large 
numbers of user connections). 

■ These ISPs are connected (through specialized high-speed connections) to 
an interconnected network of network access points (NAPs). This network 
essentially forms the connected infrastructure of the World Wide Web 
(WWW). 

■ In its most basic form the WWW can be thought of as a unified array of 
connected networks. 

Internet browsers transmit and receive information on the Web via a Uniform 
Resource Locator (URL) address. This address is then broken down into three 
parts: the protocol (e.g., http), the server ñame (e.g., www.modrec.com) and the 
requested page or file ñame (e.g., index.htm). The connected server is able to 
transíate the server ñame into a specific Internet Provider (IP) address. This 








Delivery Formats 


FIGURE 10.3 

The Internet works 
by communicating 
requests and data 
from a user’s 
Computer to a single 
server that’s 
connected to other 
servers around the 
world, which are 
likewise connected to 
other users’ 
computers. 



address is used to connect your Computer with the desired server, after which 
the requests to receive or send data are communicated and the information is 
passed to your desktop. 

E-mail works in a similar data transfer fashion, with the exception that an e-mail 
isn't sent to or requested from a specific server (which might be simultaneously 
connected to a large group of users); rather, it's communicated from one specific 
e-mail address (e.g., myname@myprovider.com) to a destination address (e.g., 
yourname@yourprovider.com) . 


DELIVERY FORMATS 

When creating content for the various media Systems, it's extremely important 
that you match the media format and bandwidth requirements to the content 
delivery system that's being used. In other words, it's always smart to maximize 
the efficiency of the message (media format and required bandwidth) to 
match (and not aliénate) your intended audience. The following section out- 
lines many standard and/or popular formats for delivering media to a target 
audience. 

Digital audio 

Digital audio is obviously a component that adds greatly to the multimedia 
experience. It can augment a presentation by adding a dramatic music 
soundtrack, help us to communicate through speech or give realism to a 
soundtrack by adding sound effects. Because of the large amounts of data 
required to pass video, graphics and audio from a CD-ROM, the Internet or 
other media, the bit- and sample-rate structure of an uncompressed audio file 
is usually limited, compared to that of a professional-quality soundfile. At the 
"lo-fi" range, the generally accepted soundfile standard for multimedia produc- 
tion is either 8-bit or 16-bit audio at a sample rate of 11.025 or 22.050 kHz. 
This standard had come about mostly because older CD drive and processor 
Systems generally couldn't pass the professional 44.1-kHz rate. With the intro- 
duction of faster processing systems and better hardware, these limitations have 
generally been lifted to inelude 44.1-kHz/16-bit audio and compressed data 
formats that offer CD quality and discrete surround-sound playback capabili- 
ties. In addition, there are obvious limitations to communicating uncompressed 
professional-rate soundfiles over the Internet or from a CD or DVD disc that's 
also streaming full-motion video. Fortunately, with improvements in codee 
(encode/decode) techniques, hardware speed and design, the overall sonic and 
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production quality of compressed audio data has greatly improved in audience 
acceptance. 

Uncompressed soundfile formats 

Although several formats exist for encoding and storing soundfile data, only a 
few have been universally adopted by the industry. These standardized formats 
make it easier for files to be exchanged between compatible media devices. 
Probably the most common file type is the Wave (or .wav) format. Developed 
for the Microsoft Windows format, this universal file type supports both mono 
and stereo files at a variety of uncompressed resolutions and sample rates. Wave 
files contain pulse-code modulation (PCM) coded audio that follows the 
Resource Information File Format (RIFF) spec, which allows extra user Informa¬ 
tion to be embedded and saved within the file itself. The newly adopted Broad- 
cast Wave format, which has been adopted by the NARAS Producers and 
Engineers wing (www.grammy.com/Recording_Academy/Producers_and_Engi- 
neers) as the preferred soundfile format for DAW production and music 
archiving, allows for time-code-related positioning information to be directly 
embedded within the soundfile's datastream. 

The most commonly used file production format using Mac computers is the 
Audio Interchange File (AIFF; .aif) format. Fike Wave files, AIFF files support 
mono or stereo, 8-bit or 16-bit audio at a wide range of sample rates—and like 
Broadcast Wave files, AIFF files can also contain embedded text strings. Table 
10.3 details the differences between uncompressed file sizes as they range from 
the 32-bit/192-kHz rates that are used to encode DVD-Audio sound all the way 
down to voice-quality 8-bit/10-kHz files. 

Compressed codee soundfile formats 

As was mentioned earlier, high-quality uncompressed soundfiles often present 
severe challenges to media delivery systems that are restricted in terms of band- 
width, download times or memory storage. Although the streaming of audio 
data from various media and high-bandwidth networks (including the Web) 
has improved over the years, memory storage space and other bandwidth limi- 
tations have led to the popular acceptance of audio-related data formats known 
as codees. These formats can encode audio in a manner that reduces data file 
size and bandwidth requirements and then decode the information upon play- 
back using a system known as perceptual coding. 

PERCEPTUAL CODING 

The central concept behind perceptual coding is the psychoacoustic principie that 
the human ear will not always be able to hear all of the information that's 
present in a recording. This is largely due to the fact that louder sounds will 
often mask sounds that are both lower in level and relatively cióse to another 
louder signal. These perceptual coding schemes take advantage of this masking 
effect by filtering out noises and sounds that can't be detected and removing 
them from the encoded audiostream. 
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Table 10.3 Audio Bit Rate and File Sizes 


Sample 

Rate 

Word 

Length 

No. of 
Channels 

Data Rate 
(kbps) 

MB/min 

MB/hour 

192 

24 

2 

1152 

69.12 

4147.2 

192 

24 

1 

576 

34.56 

2073.6 

96 

32 

2 

768 

46.08 

2764.8 

96 

32 

1 

384 

23.04 

1382-4 

96 

24 

2 

576 

34.56 

2073.6 

96 

24 

1 

288 

17.28 

1036.8 

48 

32 

2 

384 

23.04 

1382.4 

48 

32 

1 

192 

11.52 

691.2 

48 

24 

2 

288 

17.28 

1036.8 

48 

24 

1 

144 

8.64 

518.4 

48 

16 

2 

192 

11.52 

691.2 

48 

16 

1 

96 

5.76 

345.6 

44.1 

32 

2 

352 

21.12 

1267.2 

44.1 

32 

1 

176 

10.56 

633.6 

44.1 

24 

2 

264 

15.84 

950.4 

44.1 

24 

1 

132 

7.92 

475.2 

44.1 

16 

2 

176 

10.56 

633.6 

44.1 

16 

1 

88 

5.28 

316.8 

32 

16 

2 

128 

7.68 

460.8 

32 

16 

1 

64 

3.84 

230.4 

22 

16 

2 

88 

5.28 

316.8 

22 

16 

1 

44 

2.64 

158.4 

22 

8 

1 

22 

1.32 

79.2 

11 

16 

2 

44 

2.64 

158.4 

11 

16 

1 

22 

1.32 

79.2 

11 

8 

1 

11 

0.66 

39.6 
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The perceptual encoding process is said to be lossy or destructive, because once 
the filtered data has been eliminated it can't be replaced or introduced back 
into the file. For the purposes of audio quality, the amount of audio that's to 
be removed from the data can be selected by the user during the encoding 
process. Higher bandwidth compression rates will remove less data from a 
stream (resulting in a reduced amount of filtering and higher audio quality), 
while low bandwidth rates will greatly reduce the datastream (resulting in 
smaller file sizes, increased filtering, increased artifacts and lower audio quality). 
The amount of filtering that's to be applied to a file will depend on the intended 
audio quality and the delivery medium's bandwidth limitations. Due to the 
lossy character of these encoded files, it's always a good idea to keep a copy of 
the original, uncompressed soundfile as a data archive backup should changes 
in contení or future technologies occur (never underestimate Murphy's law of 
technology). 

The perceptual coding schemes in most common use are: 

■ MP3 

■ MP4 

■ WMA 

■ AAC 

■ RealAudio 

■ FLAC. 

Many of the listed codees are capable of encoding and decoding audio using a 
constant bit rate (CBR) and variable bit rate (VBR) structure: 

■ CBR encoding is designed to work effectively in a streaming scenario 
where the end user's bandwidth is a consideration. With CBR encoding, 
the chosen bit rate will remain constant over the course of the file or 
stream. 

■ VBR encoding is designed for use when you want to create a downloadable 
file that has a smaller file size and bit rate without sacrificing sound and 
video quality. This is carried out by detecting which sections will need the 
highest bandwidth and adjusting the encode process accordingly. When 
lower rates will suffice, the encoder adjusts the process to match the 
contení. LJnder optimum conditions, you might end up with a VBR- 
encoded file that has the same quality as a CBR-encoded file, but is only 
half the file size. 


MP3 

MPEG (which is pronounced "M-peg" and stands for the Moving Picture Experts 
Group; www.mpeg.org) is a standardized format for encoding digital audio and 
video into a compressed format for the storage and transmission of various 
media over the Web. As of this writing, the most popular format is the ISO- 
MPEG Audio Level-2 Layer-3, commonly referred to as MP3. Developed by the 
Fraunhofer Institute (www.iis.fhg.de) and Thomson Multimedia in Europe, 
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MP3 has advanced the public awareness and acceptance of compressing and 
distributing digital audio by creating a codee that can compress audio by a 
substantial factor while still maintaining quality levels that approach those of 
a CD (depending on which compression levels are used). 

Although faster Web connections are capable of streaming MP3 in real time, 
this format is most often downloaded to the end consumer for storage to disk, 
disc and memory media for the storage and playback of downloaded songs. 
Once saved, the data can then be transferred to solid-state playback devices 
(such as portable MP3 players, PDAs, cell phone players—you ñame it!). In fact, 
billions of music tracks are currently being downloaded every month on the 
Internet using MP3, practically every personal Computer contains licensed MP3 
software, virtually every song has been MP3 encoded and an astounding number 
of MP3 players are on the global market. This makes it the Web's most popular 
audio compression format by far. 

In 2001, MP3 Pro was introduced to the public as an encoding system for 
enhancing sound quality and improving the compression scheme. MP3 Pro 
works by splitting the encoding process into two parts. The first analyzes the 
low-frequency band information and encodes it into a normal MP3 stream 
(which allows for complete compatibility with existing MP3 players). The 
second analyzes the high-frequeney band information and encodes it in a way 
that helps preserve the high-frequeney content. When combined, the MP3 Pro 
codee creates a file that's more compact than original MP3 files, with equal or 
better sound quality and complete backward and forward compatibility. 
Although MP3 Pro professes to offer 128-kbps performance at a 64-kbps encod¬ 
ing rate (effectively doubling the digital music capacity of flash memory and 
compact dises), the general download community has been slow to adopt this 
new codee. 

MP4 

Like MP3, the MPEG-4 (MP4) codee is largely used for streaming data over the 
Web or for use with portable media devices. Largely based on Apple's Quick¬ 
Time "MOV" format, MP4 is used to encode both audio and video media data 
at various quality bit depths and with multichannel (surround) capabilities. 

WMA 

Developed by Microsoft as their corporate response to MP3, Windows Media 
Audio (WMA) allows for compression rates that can encode high-quality audio 
at low bit-rate and file-size settings. Designed for ripping (encoding audio from 
audio CDs) and soundfile encoding/playback from within the popular Win- 
dow's Media Player (Figure 10.4), this format has grown in general acceptance 
and popularity. In addition to its high quality at low bit rates, WMA has the 
advantage of allowing for real-time streaming over the Internet (as witnessed 
by the large amount of radio and Internet stations that stream to the Windows 
Media Player at various bit-rate qualities). Finally, content providers often favor 
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FIGURE 10.4 

Window’s Media 
Player in skin mode; 
notice that the “Bars 
and Waves” 
visualization theme 
can act as a basic 
spectral display. 


MWA over MP3, because it is able to provide for a degree of contení copy 
protection through its incorporation of Digital Rights Management (DRM) 
coding. With the introduction of Windows Media Player Versión 9 and later 
versions, WMA is able to encode and deliver audio in discrete surround sound 
(Figure 10.5). 

AAC 

Jointly developed by Dolby Labs, Sony, ATT, and the Fraunhofer Institute, the 
Advanced Audio Coding (AAC) scheme is touted as a multichannel-friendly 
format for secure digital music distribution over the Internet. Stated as having 
the ability to encode CD-quality audio at lower bit rates than other coding 
formáis, AAC not only is capable of encoding 1, 2 and 5.1 surround-soundfiles 
but can also encode up to 48 channels within a single bitstream at bit/sample 
rates of up to 24/96. This format is also SDMI (Secure Digital Music Initiative) 
compliant, allowing copyrighted material to be protected against unauthorized 
copying and distribution. 

RealAudio 

With the introduction of their RealPlayer server application, RealNetworks 
(www.realnetworks.com) became one of the first companies to provide real- 
time streaming over the Web. RealAudio™ data is transmitted using any of more 
than 12 proprietary encoding levels that range from transmission rates of 8 kbps 
(low-fidelity mono voice quality over a 56-kbps modem) to speeds that exceed 
the 1.5-Mbps point. Although there are several compression levels to choose 
from, the most common music mode type compresses data in a way that doesn't 
introduce extreme artifacts over a wide dynamic range, thereby allowing it to 
create an algorithm that can faithfully reproduce music with near-FM quality 
over 56-kbps or faster lines. At the originating Internet site, the RealAudio server 
can automatically recognize which modem, cable or network connection speed 
is currently in use and then transmits the data in the best possible audio format. 
This reduced data throughput ultimately means that RealPlayer will take up very 
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FIGURE 10.5 

Windows XP speaker 
setup box for stereo, 
surround and other 
playback schemes. 
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little ofyour computer's resources, allowingyou to keep on working while audio 
is being played. 

FLAC 

FLAC (Free Lossless Audio Codee) is a format that makes use of a data compres- 
sion scheme that's cap able of reducing an audio file's data size by 40% to 50% 
in a lossless fashion that maintains the sonic integrity of the original stereo and 
multichannel source audio. As the ñame suggests, FLAC is a free, open-source 
codee that can be used by software developers in a royalty-free fashion. 

Tagged metadata 

Within certain types of multimedia file formats it's possible to embed a wide 
range of data into the file header. This data can identify and provide extensive 
information that relates to the content of the file. For example, within the 
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FIGURE 10.6 

Embedded metadata 
file tags can be 
added to a media file 
within Windows 
Explorer. 


Windows OS file structure, right-clicking on a file ñame within the Windows 
Explorer menú and selecting Properties/Summary (or Control-clicking/Get 
Info on a Mac) will bring up a set of extensive metadata file tags that can 
be used to enter and identify such media information as artist, álbum title, 
year, track number, genre, lyrics, title, comments, etc. (Figure 10.6). Note that 
this metatag info can also be entered into many DAWs, editors and OS file 
managers and can often be transferred from one format type to another upon 
conversión. 

MIDI 

One of the unique advantages of MIDI as it applies to multimedia is the rich 
diversity of musical instruments and program styles that can be played back in 
real time, while requiring almost no overhead processing from the computer's 
CPU. This makes MIDI a perfect candidate for playing back soundtracks from 
multimedia games or over the Internet. It's interesting to note that MIDI has 
taken a back seat to digital audio as a serious music playback format for mul¬ 
timedia. 
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Most likely, this is due to several factors, including: 

■ A basic misunderstanding of the médium 

■ The fact that producing MIDI content requires a basic knowledge of music 

■ The frequent difficulty of synchronizing digital audio to MIDI in a multi¬ 
media environment 

■ The fact that soundcards often inelude poorly designed FM synthesizers 
(although most operating systems now inelude higher quality software 
synths). 

Fortunately, a number of companies have taken up the banner of embedding 
MIDI within their media projeets and have helped push MIDI a bit more into 
the Web mainstream. As a result, it's becoming more common for your PC to 
begin playing back a MIDI score on its own or perhaps in conjunction with a 
game or more data-intensive program. 

Standard MIDI files 

The accepted format for transmitting files or real-time MIDI information in 
multimedia (or between sequencers from different manufacturers) is the stan¬ 
dard MIDI file. This file type (which is stored with a .mid or .smf extensión) is 
used to distribute MIDI data, song, track, time signature and tempo information 
to the general masses. Standard MIDI files can support both single and multi- 
channel sequence data and can be loaded into, edited and then directly saved 
from almost any sequencer package. When exporting a standard MIDI file, keep 
in mind that they come in two basic flavors—type 0 and type 1: 

■ Type 0 is used whenever all of the tracks in a sequence need to be merged 
into a single MIDI track. All of the notes will have a channel number 
attached to them (i.e., will play various instruments within a sequence); 
however, the data will have no definitive track assignments. This type 
might be the best choice when creating a MIDI sequence for the Internet 
(where the sequencer or MIDI player application might not know or care 
about dealing with múltiple tracks). 

■ Type 1, on the other hand, will retain its original track information struc- 
ture and can be imported into another sequencer type with its basic track 
information and assignments left intact. 

General MIDI 

One of the most interesting aspeets of MIDI production is the absolute unique- 
ness of each professional and even semipro project studio. In fact, no two 
studios will be alike (unless they've been specifically designed to be the same 
or there's some unlikely coincidence). Each artist will be unique as to his or her 
own favorite equipment, supporting hardware, way of routing channels and 
tracks and assigning patches. The fact that each system setup is unique and 
personal has placed MIDI at odds with the need for system and setup compat- 
ibility in the world of multimedia. For example, after importing a MIDI file over 
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the Net that's been created in another studio, the song will most likely attempt 
to play with a totally irrelevant set of sound patches (it might sound interesting, 
but it won't sound anything like what was originally intended). If the MIDI file 
is loaded into another setup, the sequence will again sound completely differ- 
ent, with patches that are so irrelevant that the guitar track might sound like a 
bunch of machine-gun shots from the planet Gloop. 

To eliminate (or at least reduce) the basic differences that exist between systems, 
a standardized set of patch settings, known as General MIDI (GM), was created. 
In short, General MIDI assigns a specific instrument patch to each of the 
128 available program change numbers. Since all electronic instruments that 
conform to the GM format must use these patch assignments, placing GM 
program change commands at the header of each track will automatically 
instruct the sequence to play with its originally intended sounds and general 
song settings. In this way, no matter what sequencer and system setup is used 
to play the file back, as long as the receiving instrument conforms to the GM 
spec, the sequence will be heard using its intended instrumentation. 

Tables 10.4 and 10.5 detail the program numbers and patch ñames that conform 
to the GM format. These patches inelude sounds that imitate synthesizers, 
ethnic instruments or sound effeets that have been derived from early Roland 
synth patch maps. Although the GM spec States that a synth must respond to 
all 16 MIDI channels, the first 9 channels are reserved for instruments, while 
GM restriets the percussion track to MIDI channel 10. 


GRAPHICS 

Graphic imaging occurs on the Computer screen in the form of pixels. These are 
basically tiny dots that blend together to create color images in much the same 
way that dots are combined to give color and form to your favo rite comic strip. 
Just as word length affeets the overall amplitude range of a digital audio signal, 
the number of bits in a pixel's word will affect the range of colors that can be 
displayed in a graphic image. For example, a 4-bit word only has 16 possible 
combinations. Thus, a 4-bit word will allow your screen to have a total of 16 
possible colors; an 8-bit word will yield 256 colors; a 16-bit word will give you 
65,536 colors; and a 24-bit word will yield a whopping total of 16.7 million 
colors! These methods of displaying graphics onto a screen can be broken down 
into several categories: 

■ Ráster graphics: In ráster graphics, each image is displayed as a series of 
pixels. This image type is what is used utilized when a single graphic image 
is used (i.e., bitmap, JPEG, GIF orTIFF format). The sense of motion can 
come from ráster images only by successively stepping through a number 
of changing images every second (in the same way that standard video 
images create the sense of motion). 

■ Vector graphics: This process often creates a sense of motion by projecting 
a background ráster image and then overlaying one or more objeets that 
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Table 10.4 GM Nonpercussion Instrument Patch Map 

1. Acoustic Grand Piano 

44. Contrabass 

87. Lead 7 (fifths) 

2. Bright Acoustic Piano 

45. Tremolo Strings 

88. Lead 8 (bass + lead) 

3. Electric Grand Piano 

46. Pizzicato Strings 

89. Pad 1 (new age) 

4. Honky-tonk Piano 

47. Orchestral Harp 

90. Pad 2 (warm) 

5. Electric Piano 1 

48. Timpani 

91. Pad 3 (polysynth) 

6. Electric Piano 2 

49. String Ensemble 1 

92. Pad 4 (choir) 

7. Harpsichord 

50. String Ensemble 2 

93. Pad 5 (bowed) 

8. Clavi 

51. SynthStrings 1 

94. Pad 6 (metallic) 

9. Celesta 

52. SynthStrings 2 

95. Pad 7 (halo) 

10. Glockenspiel 

53. Choir Aahs 

96. Pad 8 (sweep) 

11. Music Box 

54. Voice Oohs 

97. FX 1 (rain) 

12. Vibraphone 

55. Synth Voice 

98. FX 2 (soundtrack) 

13. Marimba 

56. Orchestra Hit 

99. FX 3 (crystal) 

14. Xylophone 

57. Trumpet 

100. FX 4 (atmosphere) 

15. Tubular Bells 

58. Trombone 

101. FX 5 (brightness) 

16. Dulcimer 

59. Tuba 

102. FX 6 (goblins) 

17. Drawbar Organ 

60. Muted Trumpet 

103. FX 7 (echoes) 

18. Percussive Organ 

61. French Horn 

104. FX 8 (sci-fi) 

19. Rock Organ 

62. Brass Section 

105. Sitar 

20. Church Organ 

63. SynthBrass 1 

106. Banjo 

21. Reed Organ 

64. SynthBrass 2 

107. Shamisen 

22. Accordion 

65. Soprano Sax 

108. Koto 

23. Harmónica 

66. Alto Sax 

109. Kalimba 

24. Tango Accordion 

67. Tenor Sax 

110. Bag pipe 

25. Acoustic Guitar (nylon) 

68. Baritone Sax 

111. Fiddle 

26. Acoustic Guitar (steel) 

69. Oboe 

112. Shanai 

27. Electric Guitar (jazz) 

70. English Horn 

113. Tinkle Bell 

28. Electric Guitar (clean) 

71. Bassoon 

114. Agogo 

29. Electric Guitar (muted) 

72. Clarinet 

115. Steel Drums 

30. Overdriven Guitar 

73. Piccolo 

116. Woodblock 

31. Distortion Guitar 

74. Flute 

117. Taiko Drum 


Continued ... 
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Table 10.4 continued 

32. 

Guitar Harmonics 

75. Recorder 

118. 

Melodic Tom 

33. 

Acoustic Bass 

76. Pan Flute 

119. 

Synth Drum 

34. 

Electric Bass (finger) 

77. Blown Bottle 

120. 

Reverse Cymbal 

35. 

Electric Bass (pick) 

78. Shakuhachi 

121. 

Guitar Fret Noise 

36. 

Fretless Bass 

79. Whistle 

122. 

Breath Noise 

37. 

Slap Bass 1 

80. Ocarina 

123. 

Seashore 

38. 

Slap Bass 2 

81. Lead 1 (square) 

124. 

Bird Tweet 

39. 

Synth Bass 1 

82. Lead 2 (sawtooth) 

125. 

Telephone Ring 

40. 

Synth Bass 2 

83. Lead 3 (calliope) 

126. 

Helicopter 

41. 

Violin 

84. Lead 4 (chiff) 

127. 

Applause 

42. 

Viola 

85. Lead 5 (charang) 

128. 

Gunshot 

43. 

Cello 

86. Lead 6 (voice) 




Table 10.5 GM Percussion Instrument Patch Map (Channel 10) 

35. 

Acoustic Bass Drum 

51. Ride Cymbal 1 

67. 

High Agogo 

36. 

Bass Drum 1 

52. Chínese Cymbal 

68. 

Low Agogo 

37. 

Side Stick 

53. Ride Bell 

69. 

Cabasa 

38. 

Acoustic Snare 

54. Tambourine 

70. 

Maracas 

39. 

Hand Clap 

55. Splash Cymbal 

71. 

Short Whistle 

40. 

Electric Snare 

56. Cowbell 

72. 

Long Whistle 

41. 

Low Floor Tom 

57. Crash Cymbal 2 

73. 

Short Güiro 

42. 

Closed Hi-Hat 

58. Vibraslap 

74. 

Long Güiro 

43. 

High Floor Tom 

59. Ride Cymbal 2 

75. 

Claves 

44. 

Pedal Hi-Hat 

60. Hi Bongo 

76. 

Hi Wood Block 

45. 

Low Tom 

61. Low Bongo 

77. 

Low Wood Block 

46. 

Open Hi-Hat 

62. Mute Hi Conga 

78. 

Mute Cuica 

47. 

Low-Mid Tom 

63. Open Hi Conga 

79. 

Open Cuica 

48. 

Hi-Mid Tom 

64. Low Conga 

80. 

Mute Triangle 

49. 

Crash Cymbal 1 

65. High Timbale 

81. 

Open Triangle 

50. 

High Tom 

66. Low Timbale 




Note: In contrast to Table 10.4, the numbers in Table 10.5 represent the percussion keynote numbers 
on a MIDI keyboard, not program change numbers. 






























Desktop Video 


can be animated according to a series of programmable vectors. By instruct- 
ing each object to move from point A to point B to point C according to 
a defined script, a sense of animated motion can be created without the 
need to project sepárate images for each frame. This script form reduces 
a files data size dramatically and is used with several image animation 
programs (including Macromedia's Flash™, Shockwave™ and Director®). 

■ Wireframe animation: This form of animation uses a Computer to create a 
complex series of wireframe image vectors of a real or imaginary object. 
Once programmed, these stick-like objects can be filled in with any type 
of skin, color, shading, etc., and then programmed to move with a stag- 
gering degree of realism. Obviously, with the increased power of modern 
computers and supercomputers, this graphic art form has attained higher 
degrees of artistry or realism within modern-day film, video and desktop 
visual production. 


DESKTOP VIDEO 

With the proliferation of digital VCRs, video interface hardware and video 
editing software systems, desktop and laptop video has begun to play an increas- 
ingly important role in multimedia production and content. Video is encoded 
into a datastream as a continuous series of successive frames, which are refreshed 
at rates that vary from 12 or fewer frames/second (fr/sec) to the standard broad- 
cast rates of 29.97 and 30 fr/sec. As with graphic files, a single full-sized video 
frame can be made up of a gazillion pixels, which are themselves encoded as a 
digital word of n bits. Multiply these figures by nearly 30 frames and you'll come 
up with a rather impressive data file size. 

Obviously, it's more common to find such file sizes and data throughput rates 
on higher-end desktop systems and professional video editing workstations; 
however, several options are available to help bring video down to data rates 
that are suitable for multimedia and even the Internet: 

■ Windoiv size: The basics of making the viewable picture smaller are simple 
enough: Reducing the frame size will reduce the number of pixels in a 
video frame, thereby reducing the overall data requirements during play- 
back. 

■ Frame rate: Although standard video frame rates run at around 30 fr/sec 
(United States and Japan) and 25 fr/sec (Europe), these rates can be 
lowered to 12 fr/sec in order to reduce the encoded file size or throughput. 

■ Compression: In a manner similar to that which is used for audio, codees 
can be applied to a video frame to reduce the amount of data that's nec- 
essary to encode the file by filtering out and smoothing over pixel areas 
that consume data or by encoding data that doesn't change from frame 
to frame into a shorthand to reduce data. In situations where high levels 
of compression are needed, it's common to accept degradations in the 
video's resolution in order to reduce the file size and/or data throughput 
to levels that are acceptable to a restrictive médium (e.g., the Web). 
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From all of this, it's clear that there are many options for encoding a desktop 
video file. When dealing with video clips, tutorials and the like, it's common 
for the viewing window to be médium in size and encoded at a médium to 
lower frame rate. This middle ground is often chosen in order to accommodate 
the standard data throughput that can be streamed off of most CD-ROMs and 
the Web. These files are commonly encoded using Microsoft's Audio-Video 
Interleave (AVI) format, QuickTime (a common codee that was developed by 
Apple and can be played by either a Mac or PC) or MPEG 1, 2 or 4 (codees that 
vary from lower multimedia resolutions to higher ones that are used to encode 
DVD movies). Both the Microsoft Windows and Apple OS platforms inelude 
built-in or easily obtained applications that allow all or most of these file types 
to be played without additional hardware or software. 

MULTIMEDIA AND THE WEB IN THE “NEED 
FORSPEED” ERA 

The household phrase "surfin' the Web" has become synonymous with jumping 
onto the Net, browsing the sites and grabbin' onto all of those hot songs, 
videos, and graphics that might wash your way. Dude, with improved audio 
and video codees and faster data connections (Table 10.6), the ability to search 
on any subject, download files, and stream audio or radio stations (Table 10.7) 


Table 10.6 

Internet Connection Speeds 


Connection 

Speed (bps) 

Description 

56k dial-up 

56 kbps (usually less) 

Common modem connection 

ISDN 

128 kbps; older technology 


DSL 

384 kbps or higher high-bandwidth 
phone line technology 


Cable 

384 kbps and higher high-bandwidth 
cable technology 


TI 

1.5 Mbps 


T3 

45 Mbps 


OC-1 

52 Mbps 

Optical fiber 

OC-3 

155 Mbps 

Optical fiber 

OC-12 

622 Mbps 

Optical fiber 

OC-48 

2.5 Gbps 

Optical fiber 

Ethernet 

10 Mbps 

Local-area network (LAN), not 
an Internet connection 

Fast Ethernet 

100 Mbps 

LAN, not an Internet connection 
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Table 10.7 

Streaming Data File Sizes 



Data Rate 
(kbps) 

File Size 
(kB/min) 

File Size 
(MB/hr) 

Minutes on a 
650-MB CD 

Hours on a 
650-MB CD 

20 

150 

9 

4333 

72 

64 

480 

28.8 

1354 

23 

96 

720 

43.2 

903 

15 

128 

960 

57.6 

677 

11 

160 

1200 

72 

542 

9 

256 

1920 

115.2 

339 

6 

384 

2880 

172.8 

226 

4 


from any point in the world has definitely changed our perception of modern- 
day Communications. 

Thoughts on being (and getting heard) in cyberspace 

Most of us have grown up in this age of the supermarket, where everything is 
wholesaled, processed, packaged and distributed to a single clearinghouse that's 
gotten so big that older folks can only shop there with the aid of a motorized 
shopping cart. For more than six decades, the music industry has largely worked 
on a similar principie: Find artists who'll fit into an existing marketing formula 
(or, more rarely, create a new marketing image), produce and package them 
according to that formula and put tons of bucks behind them to get them heard 
and distributed. Not a bad thing in and of itself; however, for independent 
artists the struggle has been, and continúes to be, one of getting themselves 
heard, seen and noticed—without the aid of the well-oiled megamachine. With 
the creation of cyberspace, not only are established record industry forces able 
to work their way onto your desktop screen (and into your multimedia speak- 
ers), but independent artists also have a new médium for getting heard. Through 
the creation of dedicated websites, search engines, links from other sites, and 
independent music dot-coms, as well as through Creative gigging and marketing, 
new avenues have begun to open up for the Web-sawy independent artist. 

Uploading to stardom! 

If you build it, they will come! This overly simplistic concept definitely doesn't 
apply to the Web. With an ever-increasing number of dot-whatevers going 
Online every month, expecting people to come to your site just because it's 
there simply isn't realistic. Like anything that's worthwhile, it takes connections, 
persistence, a good product and good ol'-fashioned dumb luck to be seen as 
well as heard! If you're selling your music, T-shirts or whatever at gigs, on the 
streets and to family and friends, cyberspace can help increase sales by making 
it possible (and even easy) to get your band, music or clients onto several 
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independent music websites that offer up descriptions, downloadable samples, 
direct sales and a link that goes directly to your main website. Such a site could 
definitely help to get the word out to a potentially new public—and help clue 
your audience in to what you and your music are all about. 

Cyberproducts can be sold and shipped via the traditional mail or phone-in 
order channels; however, it's long been considered hip in the Web world to flash 
the silver, gold or platinum credit card to make your purchase. Because attaining 
your own credit card processing and authorization system can be costly, a 
number of cyber companies have sprung up that offer secure credit card autho¬ 
rization, billing and artist payment plans for an overall sales percentage fee 
(Figure 10.7). 

The preview and/or distribution format choice for releasing all or part of your 
music to the listening audience will ultimately depend on you and the format/ 
layout style that's been adopted by the hosting site. For example, you could do 
any or all of the following: 

■ Place short, low-fidelity segments or free songs onto a site that can be 
streamed or downloaded to entice the listener to buy. 

■ Provide free access to the entire project (at low or médium fidelity) while 
encouraging the listener to buy the CD. 

■ Place several high-fidelity cuts on your site for free as a teaser or as a gift 
to your fan base. 

■ Place the music on a secure site that's SDMI compliant, using the pay-per- 
download system. 
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¡Tunes® media player 
and music website. 
(Courtesy of Apple 
Computers, Inc., 
www.apple.com.) 
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■ Sell the completed CD package on the site. 

■ Create a fanzine to keep your fans up-to-date on goings-on, upcoming 
releases, diaries, etc. 

No matter what or how many cyber distribution methods you choose for getting 
your music out, always take the time to read through the contractual fine print. 
Although most are above board and offer to get your music out on a nonexclu- 
sive basis ... caveat emptor (let the buyer, and the contení provider, beware)! 
In your excitement to get your stuff out there, you might not realize that you 
are signing away the rights for free distribution of that particular project (or 
worse). This hints at the fací that you're dealing with the music business, and, 
as with any business, you should always tread carefully in the cyber jungle. 

Copyright protection: wanna get paid? 

It seems that, at least in recent years, many (but by no means all) of the prob- 
lems of piracy have been addressed. With the shutdown of illegal music down- 
load sites and peer-to-peer networks and the risen-from-ashes versión of 
pay-per-download sites (such as www.iTunes.com and the reborn www.napster. 
com), many of the major labels and larger independent artists are on the verge 
of seeing a light at the end of the online tunnel. Even so, the technology that 
allows music to be shared online is still subject to abuse that has led to a basic 
breakdown of the conventional music sales model. 

Secure digital music initiative 

With the vast number of software (and hardware) Systems that are able to rip 
CDs to MP3s (or any other format) and MP3s back to audio and CD-R, the 
powers that be in the recording industry have grown increasingly fearful of the 
rising prevalence of copyright infringement. Although many online music sites 
legally use these formáis to allow potential buyers to preview music before 
buying or to simply put unreleased cuts onto the Web as a freebie, a number 
of sites still exist that connect online users to databases of music that has been 
illegally ripped and posted. Obviously, neither the artists ñor the record com- 
panies are being compensated for this distribution of their music. 

As a result, the Recording Industry Association of America (RIAA), major record 
labels and industry organizations have helped to form the Secure Digital Music 
Initiative (SDMI; www.sdmi.org). SDMI is an independent forum that brings 
together the worldwide recording, consumer electronics and information tech¬ 
nology industries to develop open specifications for protecting digital music 
distribution. As a result of these efforts, the Digital Rights Management (DRM) 
system was developed as a secure and encrypted means of solving the issue of 
unauthorized copying. DRM functions by digitally locking the contení and 
limiting its distribution to only those who pay for the contení. In short, it acts 
as a digital watermark that identifies the copyright owner and provides an elec- 
tronic key that allows access to the music or information once the original copy 
has been legally purchased. 
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It should be noted that (due to the wild success of file sharing and the non- 
copy-protected nature of MP3s) SDMI had basically been shunned by the 
buying public at large ... both for its protective qualities and for various tech- 
nological obstacles that can keep even legal copies from being played. 

Internet radio 

Due to the increased bandwidth of many Internet connections and improve- 
ments in audio streaming technology, many of the world's radio stations have 
begun to broadcast on the Web. In addition to offering a worldwide platform 
for traditional radio listening audiences, a large number of corporate and inde- 
pendent Web radio stations have begun to spring up that can help to increase 
the fan and listener base of musicians and record labels. Go ahead, get on the 
Web and listen to your favo rite Mexican station en vivo ... catch the latest dance 
craze from London ... or chill to reggae rhythms streaming in from the islands, 
mon. 

The virtual E-dub 

In addition to providing another vehicle for getting an artist's music out to the 
public at large, the Internet is making it easier for artists to e-collaborate over 
cyberspace. One approach is to E-dub across the Web. Using this approach, you 
can create a rough session mix of all or a portion of a song and export it to a 
medium-resolution MP3 or preferred codee. This file (along with sheet music 
jpgs, descriptive docs, etc.) could be sent to a collaborative buddy (either across 
the Street or across the world) who could then load the file into his or her DAW. 
A track or set of tracks can then be overdubbed to the original mix in the tra¬ 
ditional fashion. The resulting file (or files) should then be encoded without 
compression or at a high resolution (at least 192 kbps) and then e-mailed back 
to the artist or producer for re-import back into the original session at the proper 
time and position. (If your DAW doesn't automatically convert the file back into 
the session's native file format, you'll need to manually convert the file yourself.) 
Using this system, the world could be your cost-effective E-dub oyster. 


ON A FINAL NOTE 

One of the most amazing things about multimedia, cyberspace and their related 
technologies is the fact that they're ever changing. By the time you read this 
book, many new developments will have occurred. Oíd concepts will have 
faded away ... and new and possibly better ones will take over and then begin 
to take on a new life of their own. Although I've always had a fascination with 
crystal balls and have often had a decent sense about new trends in technology, 
there's simply no way to foretell the many amazing things that lie ahead in the 
fields of music, music technology, multimedia and especially cyberspace. As 
with everything techno, I encourage you to read the trades and surf the Web to 
keep abreast of the latest and greatest tools that have recently arrived or are 
about to rise on the horizon. 


This page intentionally left blank 



CHAPTER11 


Synchronization 


Over the years, electronic music has evolved into an indispensable production 
tool within almost all forms of media production. In video postproduction, 
for example, audio and video transports, digital audio workstations (DAWs), 
automated consolé Systems and electronic musical instruments routinely 
work together to help create a soundtrack and refine it into its finished form 
(Figure 11.1). The technology that allows múltiple audio and visual media to 
opérate in tándem so as to maintain a direct time relationship is known as 
synchronization or sync. 

Strictly speaking, synchronization occurs when two or more related events 
happen at precisely the same time. With respect to analog audio and video 
Systems, sync is achieved by interlocking the transport speeds of two or more 
machines. For computer-related Systems (such as digital audio, MIDI and digital 
video), synchronization between devices is often achieved through the use of a 
timing dock that can be fed through a sepárate line or can be directly embed- 
ded within the digital data line itself. Frequently, it's necessary for analog and 
digital devices to be synchronized together; and as a result, a number of inge- 
nious forms of Systems communication and data translation have been devel- 
oped. In this chapter, we'll explore the various forms of synchronization used 
for both analog and digital devices, as well as current methods for maintaining 
sync between media types. 

SYNCHRONIZATION BETWEEN MEDIA TRANSPORTS 

Maintaining relative sync between media devices doesn't require that all trans¬ 
port speeds involved in the process be constant; however, it's important that 
they maintain the same relative speed and position over the course of a program. 
Physical analog devices, for example, have a particularly difficult time achieving 
this. Due to differences in mechanical design, voltage fluctuations and tape 
slippage, it's a simple fact of life that analog tape devices aren't able to maintain 
a constant playback speed, even over relatively short durations. For this reason, 
accurate sync between analog and digital machines would be nearly impossible 
to achieve over any reasonable program length without some form of timing 
lock. It therefore quickly becomes clear that if production is to utilize múltiple 
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FIGURE 11.1 

Example of an 
integrated audio 
production system. 



MIDI instruments 


DAW/sequencer 


forms of media and record/playback systems, a method for maintaining sync is 
essential. 

Timecode 

The standard method of interlocking audio, video and film transports makes 
use of a code that was developed by the Society of Motion Picture and Televi¬ 
sión Engineers (SMPTE; www.smpte.org). This timecode (or SMPTE timecode) 
identifies an exact position on a tape or media form by assigning a digital 
address that increments over the course of a program's duration. This address 
code can't slip and always retains its original location, allowing for the con- 
tinuous monitoring of tape position to an accuracy of between l/24th and 
1/3Oth of a second (depending on the media type and frame rates being used). 
These divisional segments are called frames, a term taken from film production. 
Each audio or video frame is tagged with a unique identifying number, 
known as a timecode address. This eight-digit address is displayed in the 
form 00:00:00:00, whereby the successive pairs of digits represent 
hours:minutes:seconds:frames—HH:MM:SS:FF (Figure 11.2). 

The recorded timecode address is then used to lócate a position on magnetic 
tape (or any other recorded media) in much the same way that a letter carrier 
uses a written address to match up, lócate and deliver a letter to a specific, 
physical residence (i.e., by matching the address, you can then find the desired 
physical location point, as shown in Figure 11.3a). Fet's suppose that a time- 
encoded multitrack tape begins at time 00:01:00:00, ends at 00:28:19:00 and 
contains a specific cue point (such as a glass shattering) at 00:12:53:18 (Figure 
11.3b). By monitoring the timecode readout, it's a simple matter to lócate the 
precise position that corresponds to the cue point on the tape and then perform 
whatever function is necessary, such as inserting an effect into the sound track 
at that specific point ... CRASH! 
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00:01:34:07 


FIGURE 11.2 

Readout of a SMPTE 
timecode address in 
HH:MM:SS:FF. 



ress 

^^YourTown 
Your Country 




FIGURE 11.3 

Location of relative 
addresses: (a) postal 
address analogy; (b) 
timecode addresses 
and a cue point on 
longitudinal tape. 


It should be noted that the standard method for encoding timecode within 
analog audio production is to record (stripe) SMPTE timecode onto the highest 
available track (e.g., track 24). This track can then be read directly from the track 
in either direction and at a wide range of transport tape speeds. As we'll see 
later, digital audio devices often deal with timecode and its distribution in 
various ways. 

TIMECODE WORD 

The total of all time-encoded information that's encoded into each audio or 
video sync frame is known as a timecode word. Each word is divided into 80 
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FIGURE 11.4 

Biphase modulation 
encoding. 
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Data 
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equal segments, which are numbered consecutively from 0 to 79. One word 
covers an entire audio or video frame, such that for every frame there is a unique 
and corresponding timecode address. Address information is contained in the 
digital word as a series of bits that are made up of binary l's and O's, which (in 
the case of an analog SMPTE signal) are electronically encoded in the form of 
a modulated square wave. 

This method of encoding information is known as biphase modulation. Using 
this code type, a voltage transition in the middle of a half-cycle of a square wave 
represents a bit valué of 1, while no transition within this same period signifies 
a bit valué of 0 (Figure 11.4). The most important feature about this system is 
that detection relies on shifts within the pulse and not on the pulses polarity. 
Consequently, timecode can be read in either the forward or reverse direction, 
as well as at fast or slow shuttle speeds. 

D I Y 

do it yourself 

Tutorial: SMPTE Timecode 


1. Go to the “Tutorial section” of www.modrec.com, click 
on “SMPTE Audio Example,” and play the timecode 
soundfile. Not my favorite tune, but it’s a useful one! 

2. The 80-bit timecode word is subdivided into groups of 
4 bits (Figure 11.5), whereby each grouping represents 
a specific coded piece of information. Each 4-bit 


segment represents a binary-coded decimal (BCD) 
number that ranges from O to 9. When the full frame is 
scanned, all eight of these 4-bit groupings are read 
out as a single SMPTE frame number (in hours, 
minutes, seconds and trames). 


5YNC INFORMATION DATA 

Another form of information that's encoded into the timecode word is sync 
data. This information exists as 16 bits at the end of the timecode word. These 
bits are used to define the end of each frame. Because timecode can be read in 
either direction, sync data is also used to tell the device which direction the tape 
or digital device is moving. 
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TIMECODE FRAME STANDARDS 

In productions using timecode, it's important that the readout display be directly 
related to the actual elapsed time of a program, particularly when dealing with 
the exacting time requirements of broadcasting. For this reason, timecode frame 
rates may vary from one médium, production house or country of origin to 
another. 

■ 30 fr/sec (monochrome U.S. video): In the case of a black-and-white (mono- 
chrome) video signal, a rate of exactly 30 frames per second (fr/sec) is 
used. If this rate (often referred to as non-drop code) is used on a black- 
and-white program, the timecode display, program length and actual 
clock-on-the-wall time would all be in agreement. 

■ 29.97 fr/sec (drop-frame timecode for color NTSC video): The simplicity 
of 30 fr/sec was eliminated, however, when the National Televisión 
Standards Committee (NTSC) set the frame rate for the color video 
signal in the United States and Japan at 29.97 fr/sec. Thus, if a timecode 
reader that's set up to read the monochrome rate of 30 fr/sec were used 
to read a color program, the timecode readout would pick up an extra 
0.03 frame for every second that passes. Over the duration of an hour, the 
timecode readout would differ from the actual elapsed time by a total of 
108 frames (or 3.6 seconds). To correct this difference and bring the time¬ 
code readout and the actual elapsed time back into agreement, a series of 
frame adjustments was introduced into the code. Because the goal is to 
drop 108 frames over the course of an hour, the code used for color has 
come to be known as drop-frame code. In this system, two frame counts 
for every minute of operation are omitted from the code, with the excep- 
tion of minutes 00, 10, 20, 30, 40 and 50. This has the effect of adjusting 
the frame count so that it agrees with the actual elapsed duration of a 
program. 

■ 29.97 fr/sec (non-drop-frame code): In addition to the color 29.97 drop- 
frame code, a 29.97 non-drop-frame color standard can also be found in 
video production. When using non-drop timecode, the frame count will 
always advance one count per frame, without any drops. As you might 
expect, this mode will result in a disagreement between the frame count 
and the actual clock-on-the-wall time over the course of the program. 
Non-drop, however, has the distinct advantage of easing the time calcula- 
tions that are often required in the video editing process (because no 
frame compensations need to be taken into account). 

■ 25 fr/sec EBU (standard rate for PAL video): Another frame rate format that's 
used throughout Europe is the European Broadcast Union (EBU) time¬ 
code. EBU utilizes SMPTE's 80-bit code word but differs in that it uses a 
25 fr/sec frame rate. Because both monochrome and color video EBU 
signáis run at exactly 25 fr/sec, an EBU drop-frame code isn't necessary. 

■ 24 fr/sec (standard rate for film ivork): The médium of film differs from all 
of these in that it makes use of an SMPTE timecode format that runs at 
24 fr/sec. 
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From the above, it's easy to understand why confusión often exists as to 
which frame rate should be used on a project. Basically, if you are working 
on an in-house project that doesn't incorpórate time-encoded material that 
comes from the outside world, you should choose a rate that both makes 
sense for you and is likely to be compatible with an outside facility (should the 
need arise). 

For example, American electronic musicians who are working in-house will 
often choose to work at 30 fr/sec. Those in Europe have it easy, because on that 
continent 25 fr/sec is the logical choice for all music and video productions. 
On the other hand, those who work with projects that come through the door 
from other production houses will need to take special care to reference their 
timecode rates to those used by the originating media house. This can't be 
stressed enough: If care isn't taken to keep your timecode references at the 
proper rate, while keeping degradation to a minimum from one generation to 
the next, the various media might have trouble syncing up when it comes time 
to put the final master together—and that could spell BIG trouble. 

LTC AND VITC TIMECODE 

Currently, two major systems exist for encoding timecode onto magnetic tape: 

■ Longitudinal timecode (LTC) 

■ Vertical interval timecode (VITC). 

Timecode recorded onto an analog audio or video cue track is known as longi¬ 
tudinal timecode (LTC). LTC encodes a biphase timecode signal onto the analog 
audio or cue track in the form of a modulated square wave at a bit rate of 2400 
bits/sec. The recording of a perfect square wave onto a magnetic audio track is 
difficult, even under the best of conditions. For this reason, the SMPTE standard 
has set forth an allowable rise time of 25+5 microseconds for the recording and 
reproduction of valid code. This tolerance requires a signal bandwidth of 
15 kHz, which is well within the range of most professional audio recording 
devices. Variable-speed timecode readers are often able to decode timecode 
information at shuttle rates ranging from 1/1 Oth to 100 times normal playing 
speed. This is effective for most audio applications; however, in video postpro- 
duction it's often necessary to monitor videotape at slow or still speeds. 

Because LTC can't be read at speeds slower than 1/1 Oth to l/20th normal play 
speed, two methods can be used for reading timecode. The first of these uses a 
character generator to burn timecode addresses directly into the video image of 
a worktape copy. This superimposed readout allows the timecode to be easily 
identified, even at very slow or still picture shuttle speeds (Figure 11.6). In most 
situations, LTC code is preferred for audio, electronic music and midlevel video 
production, because it's a more accessible and cost-effective protocol. 

A second method—one that is used by major video production houses—is the 
vertical interval timecode (VITC). VITC makes use of the same SMPTE address 
and user code structure as LTC but is encoded onto videotape in an entirely 
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FIGURE 11.6 

Video image showing 
a burned-in timecode 
window. 



FIGURE 11.7 

Jam sync is used to 
restore distorted 
SMPTE when copying 
code from one 
machine to another. 
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different manner. VITC actually encodes the timecode information into the 
video signal itself, inside a field known as the vertical blanking interval. 

This field is located outside the visible picture sean area. Because the timecode 
data is encoded into the video signal itself, professional helical sean video 
recorders are able to read timecode at very slow and even still-frame speeds. 
Since timecode is encoded in the video signal, an additional track can be opened 
up on a video recorder for audio or cue information, while also eliminating the 
need for a burned-in window dub. 

TIMECODE REFRESHMENT/JAM SYNC 

Longitudinal timecode operates by recording a series of square-wave pulses onto 
magnetic tape. As you now know, it's somewhat difficult to record a square 
waveform onto analog magnetic tape without having the signal suffer modérate 
to severe waveform distortion (Figure 11.7). Although timecode readers are 
designed to be relatively tolerant of waveform amplitude fluctuations, such 
distortions are severely compounded when code is copied from one analog 
recorder to another by one or more generations. For this reason, a timecode 
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refresher has been incorporated into most timecode synchronizers and MIDI 
interface devices that have sync capabilities. Basically, this process reads the 
degraded timecode information from a previously recorded track and then 
amplifies and regenerates the square wave back into its original shape so it can 
be freshly recorded to a new track or read by another device. 

Should the quality of a SMPTE signal degrade to the point where the synchro- 
nizer can't differentiate between the pulses, the code will disappear and the 
slaves will stop unless the system ineludes a feature known as jam sync. Jam 
sync also refers to the synchronizers ability to output the next timecode valué, 
even though one has not appeared at its input. The generator is said to be 
working in a freewheeling fashion, since the generated code may not agree with 
the actual recorded address valúes; however, if the dropout occurs for only a 
short period, jam syncing works well to detect or refresh the signal. (This process 
is often useful when dealing with dropouts or undependable code from VHS 
audio tracks.) Two forms of jam sync options are available: 

■ Freewheeling 

■ Continuous. 

In the freewheeling mode, the receipt of timecode causes the generator's output 
to initialize when a valid address number is detected. The generator then begins 
to count in an ascending order on its own, ignoring any deterioration or dis- 
continuity in code and producing fresh, uninterrupted SMPTE address numbers. 
Continuous jam sync is used in cases where the original address numbers must 
remain intact and shouldn't be regenerated as a continuously ascending count. 
After the reader has been activated, the generator updates the address count for 
each frame in accordance with incoming address numbers and outputs an 
identical, regenerated copy. 

Synchronization using SMPTE timecode 

To achieve a frame-by-frame timecode lock between múltiple audio, video and 
film analog transports, it's necessary to use a device or integrated system that's 
known as a synchronizer (Figure 11.8). The basic function of a synchronizer is 
to control one or more tape, computer-based or film transports (designated as 
slave machines) so their speeds and relative positions are made to accurately 
follow one specific transport (designated as the master). Although the lines of 
distinction often break down, synchronization as a whole can be divided into 
two basic system types: those that are used in project or electronic music pro- 
duction facilities and those that can be found in larger audio and video produc- 
tion and postproduction facilities. The greatest reason for this división is not 
so much a system's performance as its price and the types of devices that are 
used in the process. 

The use of a synchronizer within a project studio environment often involves 
a multiport MIDI interface that ineludes provisions for locking an analog 
audio or video transport to a digital audio, MIDI or electronic music system by 
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FIGURE 11.8 

Example of timecode 
sync production 
using a simple 
MIDI interface 
synchronizer within 
a project studio. 
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translating LTC SMPTE code into MIDI timecode (more on this later in the 
chapter). In this way, one simple device can cost effectively serve múltiple pur- 
poses to achieve lock with a high degree of accuracy. Systems that are used in 
video production and in higher levels of production will often require a greater 
degree of control and remote-control functions throughout the studio or pro¬ 
duction facility. Such a setup will often require a more sophisticated device, 
such as a control synchronizer or an edit decisión list (EDL) controller. 

SMPTE OFFSET TIMES 

In the real world of audio production, programs or songs don't always begin at 
00:00:00:00. Let's say that you were handed a recording that needed a synth 
track to be laid down onto track 7 of a song that goes from 00:11:24:03 to 
00:16:09:21. Instead of inserting more than 11 minutes of empty bars into a 
MIDI track on your synched DAW, you could simply insert an offset start time 
of 00:11:24:03. This means that the sequenced track will begin to increment 
from measure 1 at 00:11:24:03 and will maintain relative offset sync throughout 
the program. 

Offset start times are also useful when synchronizing devices to an analog or 
videotape source that doesn't begin at 00:00:00:00. As you're probably aware, 
it takes a bit of time for an analog audio transport to settle down and begin 
playing (this wait time often quadruples whenever a videotape transport is 
involved). If a program's timecode were to begin at the head of the tape, it's 
extremely unlikely that you would want to start a program at 00:00:00:00, since 
playback would be delayed and extremely unstable at this time. Instead, most 
programming involving an analog audio or video media is striped with an 
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appropriate preroll of anywhere from 10 seconds to 2 minutes. Such a preroll 
gives all of the transports ampie time to begin playback and sync up to the 
master timecode source. 

In addition, it's often wise to start the actual production or first song at an offset 
time of 01:00:00:00 (some facilities begin at 00:01:00:00). This minimizes the 
possibility that the synchronizer will become confused by rolling over at mid- 
night; that is, if the contení starts at 00:00:00:00, the preroll would be in the 
23:59:00:00 range and the synchronizer would try to rewind to zero (rolling 
the tape off the reel) instead of rolling forward. Not always fun in the heat of 
a production! 

DISTRIBUTION OF 5MPTE SIGNALS 

In a basic audio production system, the only connection that's usually required 
between the master machine and a synchronizer is the LTC timecode track. 
Generally, when connecting analog slave devices, two connections will need to 
be made between each transport and the synchronizer. These inelude lines for 
the timecode reproduce track and the control interface (which often uses the 
Sony 9-pin remóte protocol for giving the synchronizer full logic transport 
and speed-related feedback information). LTC signal lines can be distributed 
throughout the production system in the same way that any other audio signal 
is distributed. They can be routed directly from machine to machine or patched 
through audio switching systems via balanced, shielded cables or unbalanced 
cables, or a combination of both. Because the timecode signal is biphase or 
symmetrical, it's immune to cable polarity problems. 

TIMECODE LEVELS 

One problem that can plague systems using timecode is crosstalk. This happens 
when a high-level signal leaks into adjacent signal paths or analog tape tracks. 
Currently, no industry standard levels exist for the recording of timecode onto 
magnetic tape or digital tape track; however, the levels shown in Table 11.1 can 
help you get a good signal level while keeping distortion and analog crosstalk 
to a minimum. 


Table 11.1 

Optimum Timecode Recording Levels 

Tape Format 

Track Format 

Optimum Recording Level 

ATR 

Edge track (highest number) 

-5 to -10 VU 

3/4-inch VTR 

Audio 1 (L) track or time 

-5 to 0 VU code 

1-inch VTR 

Cue track or audio 3 

-5 to -10 VU 

MDM 

Highest number track 

-20 dB 


Note: If the VTR is equipped with automatic gain compensation (AGC), override the AGC and adjust 
the signal gain Controls manually. 
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MIDI-based synchronization 

With the acceptance of MIDI and digital audio within media production an 
immediate solution for a new and cost-effective synchronization protocol was 
needed that could be easily used in project, mid- and large-scaled production 
environments. Over the years, devices such as DAWs, MIDI sequencers, digital 
mixing consoles and effects devices have become increasingly integrated and 
networked into the studio environment. These advances saw the rise of an easy- 
to-use and inexpensive standard that uses MIDI to transmit sync and timecode 
data throughout a connected production system (Figure 11.9). The following 
sections outline the various forms of synchronization that are often encountered 
in a MIDI-based production environment. Simply stated, most current forms 
of synchronization use the MIDI protocol itself for the transmission of sync 
messages. These messages are transmitted along with other MIDI data over 
standard MIDI cables, with no need for additional or special connections. 

MIDI REAL-TIME MESSAGES 

While MIDI isn't related to SMPTE timecode or any external reference, it's 
important to note that MIDI has a built-in (and often transparent) protocol for 
synchronizing all of the tempo and timing elements of each MIDI device in a 
system to a master dock. This protocol operates by transmitting real-time mes¬ 
sages to the various devices through standard MIDI cables, USB and internal 
CPU paths. Although these relationships are often automatically defined within 
a system setup, one MIDI device must be designated as the master device in 
order to provide the timing information to which all other slaved devices are 
locked. MIDI real-time messages consist of four basic types that are each 1 byte 
in length: 

■ Timing dock: A dock timing that's transmitted to all devices in the MIDI 
system at a rate of 24 pulses per quarter note (ppq). This method is used 
to improve the system's timing resolution and simplify timing when 
working in nonstandard meters (e.g., 3/8, 5/16, 5/32). 


FIGURE 11.9 

Many time-based 
media devices in the 
studio can be cost 
effectively connected 
via MIDI timecode 
(MTC). 
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■ Start: Upon receipt of a timing dock message, the start command instructs 
all connected devices to begin playing from the beginning of their internal 
sequences. Should a program be in midsequence, the start command 
repositions the sequence back to its beginning, at which point it begins 
to play. 

■ Stop: Upon the transmission of a MIDI stop command, all devices in the 
system stop at their current positions and wait for a message to follow. 

■ Continué: Following the receipt of a MIDI stop command, a MIDI continué 
message instructs all instruments and devices to resume playing from the 
precise point at which the sequence was stopped. Certain older MIDI 
devices (most notably drum machines) aren't capable of sending or 
responding to continué commands. In such a case, the user must either 
restart the sequence from its beginning or manually position the device 
to the correct measure. 

MIDI TIMECODE 

MIDI timecode (MTC) was developed to allow electronic musicians, project 
studios, video facilities and virtually all other production environments to cost 
effectively and easily transíate timecode into time-stamped messages that can 
be transmitted via MIDI. Created by Chris Meyer and Evan Brooks, MIDI time¬ 
code enables SMPTE-based timecode to be distributed throughout the MIDI 
chain to devices or instruments that are capable of synchronizing to and execut- 
ing MTC commands. MIDI timecode is an extensión of MIDI 1.0, which makes 
use of existing message types that were either previously undefined or were 
being used for other, nonconflicting purposes. 

Since most modern recording devices inelude MIDI in their design, there's often 
no need for external hardware when making direct connections. Simply chain 
the MIDI cables from the master to the appropriate slaves within the system 
(via physical cables, USB or virtual internal routing). Although MTC uses a 
reasonably small percentage of MIDI's available bandwidth (about 7.68% at 
30 fr/sec), it's customary (but not necessary) to sepárate these lines from those 
that are communicating performance data when using MIDI cables. As with 
conventional SMPTE, only one master can exist within an MTC system, while 
any number of slaves can be assigned to follow, lócate and chase to the master s 
speed and position. Because MTC is easy to use and is often included free in 
many system and program designs, this technology has grown to become the 
most straightforward and commonly used way to lock together such devices as 
DAWs, modular digital multitracks and MIDI sequencers, as well as analog and 
videotape machines (by using a MIDI interface that ineludes a SMPTE-to-MTC 
converter). 

MIDI TIMECODE CONTROL STRUCTURE 

The MIDI timecode format can be divided into two parís: 


■ Timecode 

■ MIDI cueing. 
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The timecode capabilities of MTC are relatively straightforward and allow 
devices to be synchronously locked or triggered to SMPTE timecode. MIDI 
cueing is a format that informs a MIDI device of an upcoming event that's to 
be performed at a specific time (such as load, play, stop, punch in/out, reset). 
This protocol envisions the use of intelligent MIDI devices that can prepare for 
a specific event in advance and then execute the command on cue. 

MIDI timecode is made up of three message types: 

■ Quarter-frame messages: These are transmitted only while the system is 
running in real or variable speed time, in either forward or reverse direc- 
tion. True to its ñame, four quarter-frame messages are generated for each 
timecode frame. Since 8 quarter-frame messages are required to encode a 
full SMPTE address (in hours, minutes, seconds and frames: 00:00:00:00), 
the complete SMPTE address time is updated once every two frames. In 
other words, at 30 fr/sec, 120 quarter-frame messages would be transmit¬ 
ted per second, while the full timecode address would be updated 15 
times in the same period. Each quarter-frame message contains 2 bytes. 
The first byte is Fl, the quarter-frame common header; the second byte 
contains a nibble (four hits) that represents the message number (0 
through 7) and a nibble for encoding the time field digit. 

■ Full messages: Quarter-frame messages are not sent in the fast-forward, 
rewind or lócate modes, because this would unnecessarily clog a MIDI 
data line. When the system is in any of these shuttle modes, a full message 
is used to encode a complete timecode address within a single message. 
After a fast shuttle mode is entered, the system generates a full message 
and then places itself in a pause mode until the time-encoded slaves have 
located to the correct position. Once playback has resumed, MTC will 
again begin sending quarter-frame messages. 

■ MIDI cueing messages: MIDI cueing messages are designed to address indi¬ 
vidual devices or programs within a system. These 13-bit messages can be 
used to compile a cue or edit decisión list, which in turn instructs one or 
more devices to play, punch in, load, stop, and so on, at a specific time. 
Each instruction within a cueing message contains a unique number, time, 
ñame, type and space for additional information. At the present time, only 
a small percentage of the possible 128 cueing event types has been defined. 


5MPTE/MTC CONVERSION 

An SMPTE-to-MIDI converter is used to read incoming SMPTE timecode and 
convert it into MIDI timecode (and vice versa). These conversión systems are 
available as a stand-alone device or as an integrated part of an audio interface 
or multiport MIDI interface/patch bay/synchronizer system (Figure 11.10). 
Certain analog and digital multitrack systems inelude a built-in MTC port 
within their design, meaning that the machine can be synchronized to a DAW/ 
sequencing system (with a MIDI interface) without the need for any additional 
hardware. 
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FIGURE 11.10 

SMPTE timecode can 
often be generated 
throughout a 
production system, 
possibly as either 
LTC or as MTC via 
a capable MIDI or 
audio interface. 
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FIGURE 11.11 

Example of a system 
whose overall timing 
elements are locked 
to a black burst 
reference signal. 


Video’s need for a stable timing reference 

Whenever a video signal is copied from one machine to another, it's essential 
that the scanned data (containing timing, video and user information) be 
copied in perfect sync from one frame to the next. Failure to do so will result 
in severe picture breakup or, at best, the vertical rolling of a black line over the 
visible picture area. Copying video from one machine to another generally isn't 
a problem, because the VCR or VTR that's doing the copying obtains its sync 
source from the playback machine. Video postproduction houses, however, 
often simultaneously use any number of video decks, switchers and edit control- 
lers during the production of a single program. Mixing and switching between 
these sources without a stable sync source could result in chaos ... with the end 
result being a very unhappy client. 

Fortunately, referencing all of the video, audio and timing elements to an 
extremely stable timing source (called a black burst or house sync generator) 
will generally resolve this sync nightmare. This reference dock serves to synchro- 
nize the video frames and timecode addresses that are received or transmitted 
by every video-related device in a production facility, so the leading frame edge 
of every video signal occurs at exactly the same instant in time (Figure 11.11). 
By resolving all video and audio devices to a single black burst reference, you're 
assured that relative frame transitions and speeds throughout the system will 
be consistent and stable. This even holds true for slaved analog machines, 
because their transport's wow and flutter can be smoothed out when locked to 
such a stable timing reference. 
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Digital audio’s need for a stable timing reference 

The process of maintaining a synchronous lock between digital audio 
devices or between digital and analog Systems differs fundamentally from 
the process of maintaining relative speed between analog transports. This is 
due to the fact that a digital system generally achieves synchronous lock by 
adjusting its playback sample rate (and thus its speed and pitch ratio) so as 
to precisely match the relative playback speed of the master transport. When- 
ever a digital system is synchronized to a time-encoded master, a stable timing 
source is extremely important in order to keep jitter (in this case, an increased 
distortion due to rapid pitch shifts) to a minimum. In other words, the 
source's program speed should vary as little as possible to prevent any degra- 
dation in the digital signal's quality. For example, all analog tape machines 
exhibit speed variations, which are caused by tape slippage and transport 
irregularities. 

If we were to synchronize a digital device to an analog master source that 
contains excessive wow and flutter, the digital system would be required to 
constantly speed up and slow down to precisely match the transport's speed 
fluctuations. One way to avoid such a problem would be to use a source that's 
more stable, such as a video deck or DAW. 


REAL-WORLD SYNC APPLICATIONS FOR USING 
TIMECODE AND MIDI TIMECODE 

Before we delve into the many possible ways that a system can be set up to work 
in a timecode environment, it needs to be understood that each system will 
often have its own particular personality and that the connections, software and 
operation of one system might differ from those of another. This is often due 
to factors such as system complexity and the basic hardware types that are 
involved, as well as the type of hardware and software systems that are installed 
in a DAW. Larger, more expensive setups that are used to create televisión and 
film soundtracks will often involve extensive timecode and system interconnec- 
tions that can easily get complicated. 

Fortunately, the use of MIDI timecode has greatly reduced the cost and complex¬ 
ity of connecting and controlling a synchronous electronic music and project 
studio down to levels that can be easily managed by both experienced and 
novice users. Having said these things, I'd still like to stress that solving syn- 
chronization problems will often require as much intuition, perseverance, 
insight and art as it will technical skill. For the remainder of this chapter, we'll 
be looking into some of the basic concepts and connections that can be used 
to get your system up and running. Beyond this, the next best course of action 
will be to consult your manuals, seek help from an experienced friend or cali 
the tech department about the particular hardware or software that's giving both 
you and your system the willies. 
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FIGURE 11.12 

There can be only 
one master in a 
synchronized system; 
however, there can 
be any number of 
slaves. 


Master/slave relationship 

Since synchronization is based on the timing relationship between two or more 
devices, it follows that the logical way to achieve sync is to have one or more 
devices (known as slaves) follow the relative movements of a single transport 
or device (known as the master). The basic rule to keep in mind is that there 
can be only one master in a connected system; however, any number of slaves 
can be set to follow the relative movements of a master transport or device 
(Figure 11.12). 

Generally, the rule for deciding which device will be the master in a production 
system can best be determined by asking a few questions: 

■ What type of media is the master timecode media recorded on? 

■ Which device will easily serve as the master? 

■ Which device will provide the most stable timing reference? 

If the master comes to you from an outside source, the answer to the first ques- 
tion will most likely solve your dilemma. If the project is in-house and you have 
total say in the matter, you might want to research your options more fully. The 
following sections can help give you insights into which devices will best serve 
as the master within a particular system. 

Audio recorders 

In many audio production situations, whenever an analog tape recorder is con¬ 
nected in a timecode environment, this machine will want to act as the master 
because costly hardware is often required to lock an analog machine to an 
external time source. This is due to the fact that the machine's speed regulator 
(generally a DC capstan servo) must be connected into a special feedback 
control loop in a way that allows it to continuously adjust its speed by com- 
paring its present location with that of the master SMPTE timecode. As a result, 
it's often far easier and less expensive to set the analog device as the master, 
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especially if the slave device is a DAW or other digital device. When starting a 
new session, the course of action is to stripe the highest track on a clean roll of 
tape (with ascending code that continúes from the tape's beginning to its end). 
Once done, the reproduced code can be routed to the SMPTE input on your 
MIDI interface or synchronizer. Ifyou don't have a multiport interface or ifyour 
interface doesn't have a SMPTE input, you'll need to get hold of a box that 
converts SMPTE to MTC which can then be plugged into a MIDI In port for 
assignment to a DAW or sequencer device. 

VCRs 

Since video is often an extremely stable timing source, an analog or digital 
machine should almost invariably act as a system master. In fact, without expen- 
sive hardware, an analog VCR can't easily be set to act as a slave, since the various 
sync references within the machine would be thrown off and the picture would 
immediately break up or begin to roll. From a practical standpoint, locking 
other devices to a standard VCR is done in much the same way as is done with 
an analog tape machine. Professional video decks generally inelude a sepárate 
track that's dedicated to timecode (in addition to other tracks that are dedicated 
to audio). As with the earlier analog scenario, the master timecode track must 
be striped with SMPTE before beginning the project. This process shouldn't be 
taken lightly, because the timecode must conform to the timecode addresses on 
the original video master or working copy (see the earlier discussion of jam 
sync). Basically, the rule of thumb is: If you're working on a project that was 
created out of house, always use the code that was provided by the original 
production team. Striping your own code or erasing over the original code with 
your own would render the timing elements useless, because the new code 
wouldnT relate to the original addresses or inelude any variations that might 
be a part of the original master source. In short, make sure that your working 
copy ineludes SMPTE that's a regenerated copy of the original code! Should you 
overlook this, you can expect to run into timing and sync troubles, either imme¬ 
diately or later in the postproduction phase, while putting the music or dialog 
back together with the final video master—factors that will definitely lead to 
premature hair and client loss. 

Software applications 

In general, a MIDI sequencer will be programmed to act as a slave device. This 
is due to the fact that a digital sequencer can easily chase a master MIDI time¬ 
code (MTC) source and lock to a location point within a production with 
extreme ease (i.e., since MIDI is digital in nature and has low data overhead 
requirements, locating to any point within a sequence is technologically easy 
to accomplish). 

Digital audio workstations 

A computer-based DAW can often be set to act as either a master or slave. This 
will ultimately depend on the software and the situation, because most profes- 
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sional workstations can be set to chase (or be triggered by) a master timecode 
source or can generate timecode (often in the form of MIDI timecode). 

DAW support for video and picture sync 

Most modern DAWs inelude support for displaying a video track within a 
session, both as a video window that can be displayed on the monitor desktop 
and in the form of a video thumbnail track that will often appear in the track 
view as a linear guide track. Both of these provide important visual cues for 
tracking live music, sequencing MIDI tracks and accurately placing automation 
moves and effeets (sfx) at specific hitpoints within the scene (Figure 11.13). This 
feature allows audio to be built up within a DAW environment without the 
need to sync to an external devi ce at all. It's easily conceivable that through the 
use of recorded tracks, software instruments and internal mixing capabilities, 
tracks could be built up, spotted and mixed—all inside the box. 

Routing timecode to and f rom your Computer 

From a connections standpoint, most DAW, MIDI and audio application soft¬ 
ware packages are flexible enough to let you choose from any number of avail- 
able sync sources (whether connected to a hardware port, MIDI interface port 



FIGURE 11.13 

Most high-end DAW 
systems are capable 
of importing a 
videofile directly into 
the project session 
window. (Courtesy 
of Steinberg Media 
Technologies GMBH, 
www.steinberg.net.) 





























































Real-World Sync Applications 


FIGURE 11.14 

Cubase/Nuendo Sync 
Preferences dialog 
box. (Courtesy of 
Steinberg Media 
Technologies GMBH, 
www.steinberg.net.) 



or virtual sync driver). All you have to do is assign all of the slaves within the 
system to the device driver that's generating the system's master code (Figure 
11.14). In most cases, where the digital audio and MIDI sequencing applications 
are operating within the same Computer, it's best to have your DAW or editor 
generate the master code for the system. From time to time, you might run into 
an application or editor that's unable to generate timecode in any form. When 
faced with such an all-slave software environment, you'll actually need a phys- 
ical timecode master that can be routed to your editor, MIDI sequencer, etc. The 
easiest solution to this dilemma is to use an audio interface or multiport MIDI 
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interface that ineludes a software applet for generating timecode. In such a 
situation, all you need to do is to select the interfaces sync driver as your 
sync source for all slave applications. Pressing the Generate SMPTE button in 
the interface's application window or from its front panel will lock the software 
to the generated code, beginning at 00:00:00:00 or at any specified offset 
address. 

As more and more DAWs and digital mixing boards become locked to SMPTE/ 
MTC, the issue of locking the wordclock of a digital device directly to the SMPTE 
timecode stream will become more and more important. For example, slaving 
a DAW to external timecode under a full timecode lock (as opposed to a trig- 
gered free-run start) will usually require that specialized sync hardware be used 
to maintain a frame-by-frame lock. 


KEEPING OUT OF TROUBLE 

Here are a few guidelines that can help save your butt when using SMPTE and 
other timecode translations during a project: 

■ Familiarize yourself with the hardware and software involved in a project 
BEFORE the session starts. 

■ When in doubt about frame rates, special requirements or anything else, 
for that matter ... ask! You (and your client) will be glad you did. 

■ Fully document your timecode settings, offsets, start times, etc. 

■ If the project isn't to be used in-house, ask the producer what the proper 
frame rate should be. Don't assume or guess it. 

■ When beginning a new session (when using a tape-based device), always 
stripe the master contiguously from the beginning to end before the 
session begins. It never hurts to stripe an extra tape, just in case. This goes 
for both analog and digital devices. 

■ Start generating new code at a point before midnight (i.e., 01:00:00:00 
or 00:01:00:00 to allow for a preroll). If the project isn't to be used 
in-house, ask the producer what the start times should be. Don't assume 
or guess it. 

■ Never dub (copy) timecode directly. Always make a refreshed (jam 
synched) copy of the original timecode (from an analog master) before 
the session begins. 

■ Never use slow videotape speeds. In EP mode, a VHS deck runs too slowly 
to record or reproduce a reliable code. 

■ Disable noise reduction on analog audio tracks (on both audio and video 
decks). 

■ Work with copies of the original production video, and make a new one 
when sync troubles appear. 

■ It's not unusual for the timecode to be read incorrectly (when short drop- 
outs occur on the track, usually on videotape). When this happens, you 
might set the synchronizer to freewheel once the transports have initially 
locked. 


Keeping Out of Trouble 


In closing, I'd like to point out that synchronization can be a simple procedure 
or it can be a fairly complex one, depending on your requirements and the type 
of equipment that's involved. A number of books and articles have been written 
on this subject. If you're serious about production, I suggest that you do your 
best to keep up on it. Although the fundamentáis stay the same, new technolo- 
gies and techniques are constantly emerging. As always, the best way to learn 
is simply by reading and then jumping in and doing it. 


CHAPTER12 


Amplifiers 


In the world of audio, amplifiers have many applications. They can be designed 
to amplify, equalize, combine, distribute or isolate a signal. They can even be 
used to match signal impedances between devices. At the heart of any amplifier 
(i amp ) system is either a vacuum tube or a semiconductor-type transistor series 
of devices. Everyone has heard of these regulating devices, but few have a grasp 
of how they opérate, so let's have a basic look into these electronic wonders. 

AMPLIFICATION 

To best understand how the theoretical process of amplification works, let's 
draw on an analogy. The original term for the tube used in early amplifiers is 
valve (a term that's still used in England and other Commonwealth countries). 
If we hook up a physical valve to a high-pressure water hose, large amounts of 
water pressure can be controlled with very little effort, simply by turning the 
valve (Figures 12.1a and 12.1b). llsing a small amount of expended energy, a 
trickle of water can be turned into a high-powered gusher and back again. In 
practice, both the vacuum tube and the transistor work much like this valve. For 
example, a vacuum tube operates by placing a DC current across its píate and 
a heated cathode element (Figure 12.2). A wire mesh grid separating these two 
elements acts like a control valve, allowing electrons to pass from the píate to 
the cathode. By introducing a small and varying signal at the input onto the 
tube's grid, a much larger electrical signal can be used to correspondingly regú¬ 
late the flow of electrons between the píate and the cathode (Figure 12.3). 

Although the transistor (a term originally derived from "trans-resistor" meaning 
a device that can easily change resistance) operates under a different electrical 
principie than a tube-based amp, the valve analogy still applies. Figure 12.4 
shows a basic amplifier schematic with a DC power source that's placed across 
the transistor's collector and emitter points. As with the valve analogy, by pre- 
senting a small control signal at the transistor's base, the resistance between the 
collector and emitter will correspondingly change. This allows a much larger 
analogous signal to be passed to the device's output. 

As a device, the transistor isn't inherently linear; that is, applying an input signal 
to the base won't always produce a corresponding output change. The linear 
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FIGURE 12.1 

The current through 
a vacuum tube 
or transistor is 
controlled in a 
manner that’s similar 
to the way that a 
valve tap can control 
water pressure 
through a water pipe: 

(a) open valve; 

(b) closed valve. 




FIGURE 12.2 

An example of a 
triode vacuum tube. 


grid 

(valve) 


operating región of a transistor lies between the 
device's lower-end cutoff región and an upper 
saturation point (Figure 12.5). Within this 
operating región, however, changes at the input 
will produce a corresponding (linear) change in 
the collector's output signal. When operating 
near these cutoff or saturation points, the base 
current lines won't be linear and the output will 
be distorted. In order to keep the signal within 
this linear operating range, a DC bias voltage 
signal is applied to the base of the transistor 
(for much the same reason a high-frequency 
bias signal is applied to an analog recording 
head). After a corrective voltage has been 
applied and sufficient amplifier design characteristics have been met, the amp's 
dynamic range will be limited by only two factors: noise (which results from 
thermal electrón movement within the transistor and other circuitry) and 
saturation. 



Amplifier saturation results when the input signal is so large that its DC output 
supply isn't large enough to produce the required, corresponding output signal. 
Overdriving an amp in such a way will cause a mild to severe waveform distor- 
tion effect known as clipping (Figure 12.6). For example, if an amp having a 
supply voltage of +24 volts (V) is operating at a gain ratio of 30:1, an input 


























































Amplifiers CHAPTER12 


Power Supply 




current 



FIGURE 12.3 

A schematic showing 
how small changes 
in voltage at the 
tube’s grid can 
produce much larger, 
corresponding 
amplitude changes 
between its cathode 
and píate. 


FIGURE 12.4 

A simple schematic 
showing how small 
changes in current at 
the transistor’s base 
can produce much 
larger, corresponding 
amplitude changes 
through the emitter 
and collector to the 
output. 


signal of 0.5 V will produce an output of 15 V. 
Should the input be raised to 1 V, the required 
output level would have to be increased to 30 V. 
However, since the máximum output voltage is 
limited to 24 V, levels above this point will be 
chopped off or "clipped" at the upper and lower 
edges of the waveform. Whenever a transistor and 
integrated Circuit design clips, severe odd-order har- 
monics are often introduced that are immediately 
audible. Tube amp designs, on the other hand, tend 
to lend a more musical sounding, even-order har- 
monic aspect to a clipped signal. I'm sure you're 
aware that clipping distortion can be a sought-after 
part of an instrument's sound (electric guitars thrive 
on it); however, it's rarely a desirable effect in quality 
studio and monitoring gear. The best way to avoid 
undesirable distortion is to be aware of the various 
amp and device gain stages throughout the studio's 
signal chains. 



FIGURE 12.5 

Operating región of a transistor. 












































The Operational Amplifier 
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FIGURE 12.6 


THE OPERATIONAL AMPLIFIER 

An operational amplifier ( op-amp ) is a stable, high-gain, 
high-bandwidth amp that has a high-input impedance 
and a low-output impedance. These qualities allow 
op-amps (Figure 12.7) to be used as a basic building block 
for a wide variety of audio and video applications, simply 
by adding components onto the basic Circuit in a build- 
ing-block fashion to fit the design's needs. To reduce an 
op-amp's output gain to more stable, workable levels, a 
negative feedback loop is often required. Negative feedback 
is a technique that applies a portion of the output signal 
through a limiting resistor back into the negative or phase- 
inverted input terminal. By feeding a portion of the amp's 
output back into the input out of phase, the device's 
output signal level is reduced. This has the effect of con- 
trolling the gain (by varying the negative resistor valué) in 
a way that also serves to stabilize the amp and further 
reduce distortion. 

PREAMPLIFIERS 


A clipped waveform. One of the mainstay amplifier types found at the input 

section of most professional mixer, consolé and outboard 
devices is the preamplifier (preamp). This amp type is often 


Negative Feedback 



used in a wide range of applications, such as 
boosting a mic's signal to line level, providing 
variable gain for various signal types and isolat- 
ing input signáis and equalization, just to ñame 
a few. Preamps are an important component in 
audio engineering because they often set the 
"tone" of how a device or system will sound. 
Just as a microphone has its own sonic charac- 
ter, a preamp design will often have its own 
"sound." Questions such as "Are the op-amps 
designed from quality components?" "Do they 
use tubes or transistors?" "Are they quiet or 
noisy?" are important considerations that can 
greatly affect the overall sound of a device. 


FIGURE 12.7 

Basic op-amp EQUALIZERS 

configuration. _ . „ 7 . . . . r ,. . . 

Basically, an equalizer is nothmg more than a rrequency-discrimmatmg ampli- 

fier. In most analog designs, equalization (EQ) is achieved through the use of 

resistor/capacitor networks that are located in an op-amp's negative feedback 

loop (Figures 12.8 and 12.9) in order to boost (amplify) or cut (attenuate) 

certain frequencies in the audible spectrum. By changing the Circuit design, 

complexity and parameters, any number of EQ curves can be achieved. 
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FIGURE 12.8 

Low-frequency 
equalizer Circuit 


R1 R3 RV 



FIGURE 12.9 

High-frequency 
equalizer Circuit 


SUMMING AMPLIFIERS 

A summing amp (also known as an active combining amplifier) is designed to 
combine any number of discrete inputs into a single output signal bus, while 
providing a high degree of isolation between them (Figure 12.10). The summing 
amplifier is an important component in analog console/mixer design because 
the large number of internal signal paths require a high-degree of isolation in 
order to prevent signáis from inadvertently leaking into other audio paths. 

DISTRIBUTION AMPLIFIERS 

Often, it's necessary for audio signáis to be distributed from one device to 
several other devices or signal paths within a recording consolé or music studio. 
In this situation, a distribution amp isn't used to provide gain but instead will 
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FIGURE 12.10 

A summing amp is 
used to provide 
isolation between 
various inputs and/or 
outputs in a signal 
Chain. 



FIGURE 12.11 

Distribution amp. 
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amplify the signal's current (power) that's being delivered to one or more loads 
(Figure 12.11). Such an amp, for example, could be used to boost the overall 
signal power so that a single feed could be distributed to a large number of 
headphones during a string or ensemble session. 

POWER AMPLIFIERS 

As you might expect, power amplifiers (Figures 12.12 and 12.13) are used to boost 
the audio output to a level that can drive one or more loudspeakers at their 
rated volume levels. Although these are often reliable devices, power amp 
designs have their own special set of problems. These inelude the fact that tran¬ 
sistor don't like to work at the high temperatures that can be generated during 
continuous, high-level operation. Such temperatures can also result in changes 
in the unit's response and distortion characteristics ... or outright failure. This 
often requires that protective measures (such as fuse and thermal protection) 
be taken. Fortunately, many of the newer amplifier models offer protection 
under a wide range of Circuit conditions (such as load shorts, mismatched loads 
and even open "no-load' circuits) and are usually designed to work with speaker 
impedance loads ranging between 4 and 16 ohms (most speaker models are 
designed to present a nominal load of 8 ohms). When matching an amp to a 
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FIGURE 12.12 

QSC Audio’s RMX 
1850HD professional 
power amplifier: 

(a) front panel; 

(b) rear panel. 
(Courtesy of QSC 
Audio Products, Inc., 
www.qscaudio.com.) 



FIGURE 12.13 

Bryston 9B SST 
professional 
5-channel amplifier. 
(Courtesy of Bryston 
Ltd., www.bryston. 
ca.) 


speaker set, the amp should be capable of delivering sufficient power to properly 
drive the speakers. If the speaker's sensitivity rating is too low or the power 
rating too high for what the amp can deliver, there could be a tendency to 
"overdrive" the amp at levels that could cause the signal to be clipped. In addi- 
tion to sounding distorted, clipped signáis can contain a high-level DC compo- 
nent that could potentially damage the speaker's voice coil drivers. 

VOLTAGE- AND DIGITALLY 
CONTROLLED AMPLIFIERS 

Up to this point, our discussion has largely focused on analog amps whose 
output levels are directly proportional to the signal level that's present at its 











































Voltage- and Digitally Controlled Amplifiers 


Out 


input. Several exceptions to this principie are the voltage- 
controlled amplifier (VCA) and the digitally-controlled amplifier 
(DCA). In the case of the VCA, the overall output gain is a 
function of an external DC voltage (generally ranging from 0 
to 5 V) that's applied to the device's control input (Figure 
12.14). As the control voltage is increased, the analog 
signal will be proportionately attenuated. Likewise, a digitally 
controlled external voltage can be used to control the amp's 
FIGURE 12.14 overall gain. Certain older consolé automation systems, auto- 

Simplified example of mated analog signal processors and even newer digital consolé designs make 

a use of VCA technology to digitally store and automate levels. 

amplifier. 



With the wide acceptance of digital technology in the production studio, it's 
now far more common to find devices that use digitally controlled amplifiers 
to control the gain of an audio signal. Although most digital devices change the 
gain of a signal directly within the digital domain, it's also possible to change 
the gain of an analog signal using an external digital source. Much like the VCA, 
the overall gain of an analog amp can be altered by placing a series of digitally 
controlled step resistors into its negative feedback loop and digitally varying the 
amount of resistance that's required to achieve the desired gain. 








CHAPTER13 


The Art and Technology 
of Mixing 


In the past, almost all commercial music was mixed by a professional recording 
engineer under the supervisión of a producer and/or artist. Although this is still 
true at many levels, with the emergence of the project studio, the vast majority 
of production facilities have become much more personal and cost effective in 
nature. Additionally, with the maturation of the digital revolution, individuáis, 
small businesses and artists are taking the time to become experienced in the 
commonsense rules of Creative and commercial mixing in their own production 
workspaces. 

Within the music production industry, it's a well-known fact that most profes¬ 
sional mixers have to earn their "ears" by logging countless hours behind the 
consolé. Although there's no substitute for this experience, the mixing abilities 
and ears of producers and musicians outside of the pro studio environment are 
also steadily improving as equipment quality gets better and as practitioners 
become more knowledgeable about proper mixing environments and tech- 
niques—often by mixing their own compositions. 

In this chapter, we'll be taking a basic look at the technology and art of mixing, 
and will gain insights into how the consolé, mixer, digital audio workstation 
and modern production equipment can work together to improve both your 
techniques and your sound. 

The basic purpose of an audio production consolé or mixer (Figures 13.1 through 
13.3) is to give us full control over volume, tone, blending and spatial position- 
ing for any or all signáis that are applied to its inputs from microphones, elec- 
tronic instruments, effects devices, recording systems and other audio devices. 
An audio production consolé (which also goes by the ñame of board , desk or 
mixer ) should also provide a straightforward way to quickly and reliably route 
these signáis to any appropriate device in the studio or control room so they 
can be recorded, monitored and/or mixed into a final product. A consolé or 
mixer can be likened to an artist's palette in that it provides a Creative control 
surface that allows an engineer to experiment and blend all the possible vari¬ 
ables onto a sonic canvas. 
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FIGURE 13.1 

Solid State Logic 
Duality consolé 
at Mono Music, 
Sweden. (Courtesy 
of Solid State Logic, 
www.solid-state- 
logic.com.) 



FIGURE 13.2 

Digidesign ICON 
Integrated consolé. 
(Courtesy of 
Digidesign, a división 
of Avid Technology, 
Inc., www.digidesign. 
com.) 



THE RECORDING PROCESS 

Before multitrack recorders carne onto the production scene, all the sounds and 
effects of a recording were mixed together at one time during a live performance. 
If the recorded blend (or mix, as it's called) wasn't satisfactory or if one musician 
made a mistake, the selection had to be performed over until the desired balance 
and performance was obtained. However, with the introduction of multitrack 
recording, the production phase of a modern recording session has radically 
changed into one that generally involves three stages: 
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FIGURE 13.3 

Mackie 8-bus analog 
consolé with 
expansión sidecars. 
(Courtesy of Loud 
Technologies Inc., 
www.mackie.com.) 


■ Recording 

■ Overdubbing 

■ Mixdown. 

Recording 

The recording phase involves the physical process of capturing live or sequenced 
instruments onto a recorded médium (disk, tape or whatever). Logistically, this 
process can be carried out in a number of ways: 

■ All the instruments to be used in a song can be recorded in one live pass. 

■ Electronic instruments, which were previously arranged and sequenced to 
form the basic foundation of a song, can be recorded onto the various 
tracks of a recorder or digital audio workstation (DAW) in such a way that 
other live instruments, vocal tracks and so on, can be added at a later time. 

■ Live musicians can be used to lay down the basic foundation tracks 
(usually rhythm) of a song. 

■ Instruments, vocals, etc., can then be added one at a time or in small 
groups at a later time during the overdub phase. 

The last two of these procedures are the most commonly encountered in the 
recording of popular music. The resulting foundation tracks (to which other 
tracks can be added at a later time) are called basic, rhythm or bed tracks. These 
consist of instruments that provide the rhythmic foundations of a song and 
often inelude drums, bass, rhythm guitar and keyboards (or any combination 
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FIGURE 13.4 

When recording 
popular music, each 
instrument is 
generally recorded 
onto a sepárate track 
(or stereo tracks) of a 
recording device. 



thereof). An optional vocal guide (scratch track) can also be recorded at this 
time to help the musicians and vocalists capture the proper tempo and that 
all-important feel of a song. 

When recording popular music, each instrument is generally recorded onto 
sepárate tracks of a tape or DAW recorder (Figure 13.4). This is accomplished 
by plugging each mic into an input strip on the consolé (either directly into the 
mixer or audio interface itself or into an appropriate input on a mic panel that's 
located in the studio), setting the gain throughout the input strip's signal path 
to its optimum level and then assigning each signal to an appropriate consolé 
output, which is finally routed to a desired channel/track on the recording 
device. Although monitoring is important during this phase, the beauty behind 
this process is that the final volume, effects and placement changes can be made 
at a later time—during the mixdown stage. 

When recording to an analog recorder (as most DAWs have fewer track limita- 
tions), an instrument or group of instruments that require múltiple mies can 
be recorded onto a single track (or a stereo pair of tracks) by assigning the 
various input strips on a consolé or mixer to the same output bus (in a process 
known as grouping). These combined signáis can then be balanced in level, 
panned, equalized and processed by monitoring the grouped outputs at the 
consolé or recording device (Figure 13.5). Whenever múltiple sources are 
recorded in a grouped fashion onto a single track or tracks, a great deal of care 
should be taken when setting the various volumes, tonalities and placements. 
As a general rule, it's far more difficult to make changes to recorded grouped 
tracks, because changes to one instrument will almost always directly affect the 
overall group mix. 
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FIGURE 13.5 

Several instruments 
can be “grouped” 
onto a single track 
(or stereo pair of 
tracks) by assigning 
each signal’s input 
strip to the same 
consolé output bus. 


In the conventional "one-instrument-per-track" setting, each signal should be 
recorded at a reasonably high level without overloading the analog or digital 
track. When recording to tape, recording at the highest level will result in an 
optimum signal-to-noise ratio, so that tape hiss, preamp noise or other artifacts 
won't impair the final product. Digital tracks, on the other hand, are more 
forgiving (due to the increased headroom, especially at higher bit rates), so it's 
always a good idea to record signáis to tape or disk at recommended levels 
(often peaking at 12 dB below the máximum overload point). 

The recording process is vitally important to the outcome of the overall project. 
The ñame of the game is to capture the best performance with the highest pos- 
sible quality, and at optimum signal levels (often without regard to level bal¬ 
ances on other tracks). In other words, a chanted whisper from a vocalist might 
easily be boosted to recorded levels that are equal to those of the electric guitar 
(which has hopefully been isolated away in a sepárate room or area). From this, 
you might guess that the process of creating a monitor mix of these unbalanced 
signáis is extremely important. 

When you ask producers about the importance of preparation, they will most 
likely place it near the top of the list for capturing a project's sound, feel and 
performance. The rhythm tracks (i.e., drums, guitar, bass and possibly piano) 
are often the driving backbone of a popular song, and recording them improp- 
erly can definitely get the project off to a bad start. Beyond making sure that 
the musicians are properly prepared and that the instruments are tuned and in 
top form (both of these being primarily the producer and/or band's job), it's 
the engineer's job to help capture a project's sound to disk or tape during 
the recording phase. Recording the best possible sound (both musically and 
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technically), without having to excessively rely on the "fix it in the mix" approach, 
will definitely start the project off on the right track. 

MONITORING 

Since the instruments have been recorded at levels that probably won't relate 
to the program's final balance, a sepárate mix must be made in order for the 
artists, producer and engineer to hear the instruments in their proper musical 
perspective ... for this, a sepárate mix is often set up for monitoring. As you'll 
learn later in this chapter, a multitrack performance can be monitored in several 
ways. No particular method is right or wrong; rather, it's best to choose a 
method that matches your own personal production style. No matter which 
monitoring style is chosen, the overall result will generally be as follows: 

■ When using a consolé or mixer during the recording process, each signal 
being fed to a track on the DAW or multitrack recorder will be fed to a 
studio monitor mix section (Figure 13.6). This submixer section is used 
to mix the individual inputs and instrument groups (with regard to level, 
panning, effects, etc.) into a musical balance that's then fed to the control 
room's main monitor speakers. Note that the program's overall balance 
can be adjusted without fear of changing the record levels—a feature that 
greatly simplifies the control room monitoring process. 

■ When using a consolé or DAW, a sepárate monitor mix can often be 
created that can be heard by musicians in the studio over headphones. In 
fact, two or more sepárate "cue" mixes are often available (depending on 
each musician's individual listening needs). 

Overdubbing 

Instruments that aren't present during the original performance can be added 
at a later time to the existing multitrack project during a process known as 
overdubbing. At this stage, musicians listen to the previously recorded tracks over 


FIGURE 13.6 

During recording, 
each signal can be 
fed to the monitor 
mix section, where 
the various 
instruments can be 
mixed and then fed 
to the control room’s 
main speakers and/ 
or the performer’s 
headphones. 
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headphones and then play along in sync while recording new and sepárate 
tracks that need to be added to the basic tracks at a later time in order to finish 
the project. 

An entirely new "take" can be laid down to tape or disk if a musician has made 
minor mistakes during an otherwise good performance, or sections of a take 
can be "punched in" to correct a bad overdub section. These new performances 
are recorded in sync with the original performances and are laid onto new tracks 
on a workstation, or when using analog, unrecorded tracks or previously 
recorded tracks that are no longer needed. When overdubbing tracks onto an 
analog multitrack recorder, it's important to remember to place the tracks that 
are to be played back into the sync mode (a process whereby the record head 
is temporarily used as a playback head in order to maintain a proper time rela- 
tionship during playback). For more information on analog recording and sync 
playback, refer to Chapter 5. For more information on recording to a DAW, 
please refer to Chapter 7. 

Mixdown 

Once all the musical parts have been performed and recorded to everyone's 
satisfaction, the mixdown or mix stage can begin. At this point, playback outputs 
of a DAW or multitrack recorder are fed to the console's line inputs. When using 
a traditional consolé layout, this is done by switching the consolé to the 
mixdown mode or by changing the appropriate input switches to the Line or 
Tape position. Most virtual DAW mixers need no mode switching and are not 
only able to output a final production mix to the interface outputs, but can 
"export" the mix as an audiofile. 

Once the mixdown process is under way, the audio is repeatedly played while 
adjustments in level, panning, EQ, effects, etc., are made for each track and/or 
track grouping. Throughout this artistic process, the individu- 
ally recorded signáis are blended into a composite 
stereo, surround or mono signal that's fed to the 
master mixdown recorder (or internally mixed 
within a DAW's software mixer). When a number 
of mixes have been made and a single versión 
has been approved, this recording (called the 
master or final mix) can be mastered to its 
intended médium and/or assembled (along with 
other programs in the project) into a final product. 

UNDERSTANDING THE UNDERLYING CONCEPT OF 
“THE MIXING SURFACE” 

In order to understand the process of mixing, it's important to understand one 
of the most important concepts in all of audio technology: the signal chain (also 
known as the signal path ). As is true with literally any audio system, the record¬ 
ing consolé can be broken down into functional components that are chained 


REMEMBER 

It’s aluiays mise to save your 
individual mixes under a nein and 
identifiable ñame.... You never knom 
mhen you might mant to revert to 
a previous mix versión. 


Understanding the Underlying Concept of “The Mixing Surface” 


together into a larger (and hopefully manageable) number of signal paths. By 
identifying and examining the individual components that work together to 
form this chain, it becomes easier to understand the basic layout of any mixing 
system, no matter how large or complex. To better understand the layout of 
a mixer, let's start with the concept that it's built of numerous building- 
block components, each having an output (source) that moves to an input 
(destination). 

In such a signal flow chain, the output of each source device must be connected 
to the input of the device or section that follows it, and so on and so forth ... 
until the end of the audio path is reached. Whenever a link in this source-to- 
destination path is broken, no signal will pass. Although this "gozinta-gozouta" 
approach might seem like a simple concept, keeping it in mind can save your 
sanity and your butt when paths, devices and cables that look like tangled piles 
of spaghetti get out of hand. 

Let's start our quest for understanding by taking a conceptual look at various 
mixing systems—from the analog hardware mixer to the general layout within 
a virtual mixing environment. In a traditional hardware mixer (which also goes 
by the ñame of board, desk or consolé) design, the signal flow for each input 
travels vertically down a plug-in strip known as an I/O module (Figures 13.7 
and 13.8) in a manner that generally flows: 

■ From the input section 

■ Through a sends section (which taps ofif to external processing/monitoring 
devices) 

■ Into an equalizer (and other processing functions, such as dynamics) 


Input Send 



FIGURE 13.7 

General anatomy of 
an input strip on the 
Mackie 8-bus analog 
consolé. (Courtesy of 
Loud Technologies, 
Inc., www.mackie. 
com.) 
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(a) 

FIGURE 13.8 

Channel input section: (a) Mackie 8-bus (courtesy of Loud Technologies, Inc., www.mackie.com); (b) Neve 88RS 
(courtesy of AMS Neve pie, www.ams-neve.com); (c) Solid State Logic XL 9000 K (courtesy of Solid State Logic, www. 
solid-state-logic.com). 
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(c) 

FIGURE 13.8 

Continued 



□ □ □ □ 



■ Passing through a monitor mix section (which taps off 
to external monitoring devices) 

■ To an output fader that ineludes pan positioning 

■ Into a routing section that can send signal to selected 
mix/signal destinations. 

Although the layout of a traditional analog hardware mixer 
won't hold true for the graphical user interface (GUI) layout 
of all virtual DAW mixers, the signal flow will most likely 
follow along the same or similar paths. Therefore, grasping 
the concept of the signal chain is equally important in the 
virtual world. A DAW's virtual mixer is likewise built from 
numerous building-block components, each having an 
output (source) and an input (destination). The output of 
each source device must be virtually connected to the input 
of the device that follows it—and so on until the end of the 
audio path is reached. Again, as with all of audio technology, 
keeping this simple concept in mind is important when 
paths, plug-ins and virtual paths seem to meld together into 
a ball of confusión. When the going gets rough, slow down, 
take a deep breath, read the manual (if you have the time)— 
and above all be patient and keep your wits about you. 

Figures 13.9 through 13.11 show the general I/O stages of 
several virtual mixing systems. It's important that you take 
the time to familiarize yourself with the inner workings of 
your own DAW (or those that you might come in contact 
with) by reading the manual, pushing buttons and by diving 
in and having fun with your own projeets. 

So, now that we have a fundamental idea of how a hardware 
and software mixing system is laid out, let's discuss the 
various stages in greater detail as they flow through the pro- 
cess, starting with a channel's input, on through the various 
Processing and send stages and then out to the final mix 
destination. 


Channel input 

The channel input serves as a preamp section to optimize 
the signal gain levels at the input of an I/O module 
before the signal is processed and routed. On a hardware 
consolé, mixer or audio interface that ineludes mic preamps, 
either a mic or line input can be selected to be the signal 
source (Figure 13.12). Although these valúes vary between 
designs, mic trims are typically capable of boosting a signal 
over a range of +20 to +70 dB, while a line trim can be varied 
in gain over a range of-15 (15-dB pad) to +45 dB. Gain 
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trims are a necessary component in the signal 
path, because the output level of a microphone 
is typically very low (-45 to -55 dB) and requires 
that a high-quality, low-noise amp be used to 
raise and/or match the various mic levels in order 
for the signal to be passed throughout the consolé 
at an optimum level (as determined by the con- 
sole's design and standard operating levels). 

Whenever a mic or line signal is boosted to levels 
that cause the preamp's output to be overdriven, 
severe clipping distortion will almost certainly 
occur. To avoid the dreaded LED overload light, 
the input gain must be reduced (by simply 
turning down the gain trim or by inserting an 
attenuation pad into the Circuit). Conversely, 
signáis that are too low in level will unnecessarily 
add noise into the signal path. Finding the right 
levels is often a matter of knowing your equip- 
ment, watching the meter/overload displays and 
using your experience. 

Input attenuation pads that are used to reduce a 
signal by a specific amount (e.g., -10 or -20 dB) 
may be inserted ahead of the preamp, in order to 
prevent input overload. On many consoles, the 
preamp outputs may be phase-reversed, via the 
"b" button. This is used to change the signal's 
phase by 180° in order to compénsate for polar- 
ity problems in the mic cable or during place- 
ment. High- and low-pass filters may also follow 
the preamp, allowing such extraneous signáis as 
amp/tape hiss or subsonic floor rumble to be 
filtered out. 


Output Path 
selector 


Pan indicator 

Record Enable 
button 

Solo button 


Volume fader 
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— Inserís (up to 5) 


If the mixer or interface is so equipped, digital 
input signáis can be inserted into the signal path. 

If this is the case, care must be taken to ensure that word sync issues are 
addressed so as to reduce possible timing conflicts and distortion between the 
digital devices. (Further info on wordclock can be found in Chapter 6.) 

INSERT POINT 

Many mixer and certain audio interface designs provide a break in the signal 
chain that occurs after the channel input (although these insert points can be 
taken at other places within many hardware consolé designs). A direct send/ 
return or insert point (often referred to simply as direct or insert) can then be 
used to send the strip's line level audio signal out to an external processing or 


— Sends (up to 10) 
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Track comment area 


FIGURE 13.9 

Virtual mixer strip 
layout for Pro 
Tools. (Courtesy of 
Digidesign, a división 
of Avid Technology, 
Inc., www.digidesign. 
com.) 
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FIGURE 13.10 

Steinberg’s Cubase 
SX/Nuendo virtual 
mixer. (Courtesy of 
Steinberg Media 
Technologies GmbH, 
a división of Yamaha 
Corporation, www. 
steinberg.net.) 
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recording device. The external device's output signal can then be inserted back 
into the signal path, where it can be routed to a destination or mixed back into 
the audio program. It's important to note that plugging a signal processor into 
an insert point will only affect the audio on that channel. 

Should you wish to affect a number of channels, the auxiliary send and group 
output section can be used to process a combined set of input channels. Phys- 
ically, the send and return insert jacks of a hardware mixer or audio interface 
can be accessed as two sepárate jacks on the top or back of a mixer/console 
(Figure 13.13a), as a single, stereo jack in the form of a tip-ring-sleeve (TRS) 
connector that carries the send, return and common signáis (as shown in Figure 
13.13b) or as access points on a console's patch bay. 

A number of mixers and consolé manufacturers (such as Mackie and Tascam) 
use unbalanced, stereo TRS jacks to directly insert a signal in several interesting 
ways (Figure 13.14; please consult the manual for your particular system's jack 
layout): 
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■ Inserting a mono send plug to the first click will connect the cable to 
the direct out signal, without interrupting the return signal path. 

■ Inserting a mono send plug all the way in will connect the cable to 
the direct out signal while interrupting the return signal path. 

■ Inserting a stereo TRS plug all the way in allows the cable to be used 
as its intended send/return loop. 

Within a workstation environment, inserís are extremely important in that 
they allow audio or MIDI processing/effects plug-ins to be directly inserted 
into the virtual path of that channel (Figure 13.15). Often, a workstation 
allows múltiple plug-ins to be inserted into a channel in a stacked fashion, 
allowing complex and unique effects to be built up. Of course, keep in mind 
that the extensive use of inserí plug-ins can eat up processing power. Should 
the stacking of múltiple plug-ins become a drain on your CPU (something 
that can be monitored by watching your processor usage meter—aka "busy 
bar"), many DAWs allow the track to be frozen, meaning that the total sum 
of the effects can be written to an audiofile, allowing the effects to be played 
back without causing undue strain on the CPU. 

Auxiliary send section 

In a hardware or virtual setting, the auxiliary (aux) sends are used to route 
and mix signáis from one or more input strips to the various effects output 
sends and monitor/headphone cue sends of a consolé. These sections are 
used to create a submix of any (or all) of the various consolé input signáis 
to a mono or stereo send, which can then be routed to any signal process¬ 
ing or signal destination (Figure 13.16). 

It's not uncommon for up to eight individual aux sends to be found on an 
input strip. An auxiliary send can serve many purposes. For example, one 
send could be used to drive a reverb unit, signal processor, etc., while 
another could be used to drive a speaker that's placed in that great sound- 
ing bathroom down the hall. A pair of sends (or a stereo send) could be 
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FIGURE 13.11 

Input strip for 
Reason’s Mixer 
14:2. (Courtesy of 
Propellerhead 
Software, www. 
propellerheads.se.) 





FIGURE 13.12 

Analog and interface 
input sections. 
(Courtesy of Loud 
Technologies, Inc., 
www.mackie.com, 
and Steinberg Media 
Technologies GmbH, 
www.steinberg.net.) 
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send to 
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send to 



return from 

(b) 


FIGURE 13.13 

Direct send/return signal paths. (a) Two jacks can be used to send signáis to and return signáis from an external 
device. (b) A single TRS (stereo) jack can be used to insert an external device into an input strip’s path. 


FIGURE 13.14 

Various insert 
positions for an 
unbalanced TRS 
send/return loop: 

(a) direct out with no 
signal interruption 

to a mixer’s main 
output path—insert 
only to first click; 

(b) direct out with an 
interruption to the 
mixer’s main output 
path—insert all the 
way to the second 
click; (c) for use as 
an insert loop (tip = 
send to external 
device, ring = return 
from external device, 
sleeve = common 
Circuit ground). 
(Courtesy of Loud 
Technologies, Inc., 
www.mackie.com.) 
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Direct Channel Inserís 


Máster Aux Sends 



FIGURE 13.15 

An effects plug-in 
can be inserted into 
the virtual path of 
a DAW’s channel 
strip. (Courtesy of 
Steinberg Media 
Technologies GmbH, 
www.steinberg.net, 
and Universal Audio, 
www.uaudio.com.) 



Send 1 
Send 2 

Send 3- 
Send 4 


used to provide a headphone mix for several musicians in the studio, while 
another send could feed a sepárate mix to the hard-of-hearing drummer. Hope- 
fully, you can see how a send can be used for virtually any signal routing or 
effects processing task that needs to be handled. How you deal with a send is 
up to you, your needs and your creativity. 


FIGURE 13.16 

Although a hardware 
mixer’s signal path 
generally flows 
vertically from top to 
bottom, an aux sends 
path flows in a 
horizontal fashion, in 
that the various 
channel signáis are 
mixed together to 
feed a mono or 
stereo send bus. The 
combined sum can 
then be effected 
(sent to another 
destination) or 
returned back to the 
main output bus. 
(Courtesy of Loud 
Technologies, Inc., 
www.mackie.com.) 


With regard to a workstation, using an aux send is a way of keeping the process¬ 
ing load on the CPU to a minimum (Figure 13.17). For example, let's say that 
we wanted to make wide use of a reverb plug-in that's generally known to be a 
CPU hog. Instead of separately plugging the reverb into a number of tracks as 
inserts, we can greatly save on processing power by plugging the reverb into 
an aux send bus. This lets us selectively route and mix audio signáis from any 
number of tracks and then send the summed signáis to a single plug-in that 
can then be mixed back into the master out bus. In effect, we've cut down on 
our power requirements by using a single plug-in to do the job of numerous 
plug-ins. 
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FIGURE 13.17 

An effects plug-in 
can be inserted into 
an effects send bus, 
allowing múltiple 
channels to share 
the same effects 
process. (Courtesy of 
Steinberg Media 
Technologies GmbH, 
www.steinberg.net, 
and Universal Audio, 
www.uaudio.com.) 
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Master Aux Sends 



Hardware EQ 


FIGURE 13.18 

Equalizer examples. 
(Courtesy of Loud 
Technologies, Inc., 
www.mackie.com, 
and Steinberg Media 
Technologies GmbH, 
www.steinberg.net.) 



Software EQ 



Equalization 

The most common form of signal processing is equalization (EQ). The audio 
equalizer (Figure 13.18) is a devi ce or circuit that lets us control the relative 
amplitude of various frequencies within the audible bandwidth. Like the aux- 
iliary sends, it derives its feed directly from the channel input section. Put 
another way, it exercises tonal control over the harmonic or timbral content of 
a recorded sound. EQ may need to be applied to a single recorded channel, to 
a group of channels or to an entire program (often as a step in the mastering 
process) for any number of other reasons, including: 

■ To correct for specific problems in an instrument or in the recorded sound 
(possibly to restore a sound to its natural tone) 
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■ To overeóme deficiencies in the frequeney response of a mic or in the 
sound of an instrument 

■ To allow contrasting sounds from several instruments or recorded tracks 
to better blend together in a mix 

■ To alter a sound purely for musical or Creative 
reasons. 

When you get right down to it, EQ is all about 
compensating for deficiencies in a sound 
pickup, "shaping" the sound of an instrument 
so that it doesn't interfere with other instru¬ 
ments in a mix or reducing extraneous sounds 
that make their way into a track. To start our 
discussion on how to apply EQ, let's take a look at 
the "Good Rule," which States: 

Let's say that at some point in the "good" chain something falls short—like, a 
mic was placed in a bad spot for a particular instrument during a session that's 
still under way. LJsing this example, we now have two options. We can change 
the mic position and overdub the track or re-record the entire song—or, we can 
decide to compénsate by applying EQ. These choices represent an important 
philosophy that's held by many producers and engineers (including myself): 
Whenever possible, EQ should NOT be used as a bandage to doctor a session 
after it's been completed. By this I mean that it's often a good idea to correct a 
problem on the spot rather than rely on the hope that you can fix it later in the 
mix using EQ and other methods. 

Although it's usually better to deal with problems as they occur, it isn't always 
possible. When a track needs fixing after it's completed, EQ is a good option 
when: 

■ There's no time or money left to redo the track. 

■ The existing take was simply magical ... and too much feeling would be 
lost if the track were to be redone. 

■ You have no control over a track that's already been recorded during a 
previous session. 

When EQ is applied to a track, bus or signal, the whole idea is to take out the 
bad and leave the good. If you keep adding EQ to the signal, it'll degrade the 
gain structure and lead to a creeping up of volume. Thus, it's often a good idea 
to use EQ to take away a deficiency in the signal but not necessarily to boost 
the desirable part of the track (which would in effect serve to turn up the overall 
gain). Such a boost will often throw off a mix's overall balance and reduce its 
overall headroom. A couple of active examples would inelude: 

■ Reducing the high end on a bass guitar instead of boosting its primary 
bass notes 

■ Using a peak filter to pulí out the ring of a snare drum (a perfect example 
of a problem that should've been corrected during the session). 


THE "GOOD RULE" 

Good musician + good instrument 
+ good performance + good acoustics 
+ good mic + good placement 
= good sound. 
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FIGURE 13.19 


Using EQ might not always be the best course of action; for 
example, to bring out an upper presence on a recorded vocal track, 
it might be best to use a peak curve to slightly boost the upper 
midrange. Just like life, nothing's ever absolute. 

Dynamics section 

Many top-of-the-line analog consoles offer a dynamics section on 
each of their I/O modules (Figure 13.19). This allows individual 
signáis to be dynamically processed more easily, without the need 
to scrounge up tons of outboard devices. Often, a full complement 
of compression, limiting and expansión (including gating) is pro- 
vided. A complete explanation of dynamic control can be found 
in Chapter 14 (Signal Processing). 

Monitor section 

During the recording phase, since the audio signáis are commonly 
recorded to tape or DAW at their optimum levels (without regard 
to the relative musical balance on other tracks), a means for creat- 
ing a sepárate monitor mix in the control room is necessary in 
order to hear a musically balanced versión of the production. A 
sepárate monitor section (as well as an aux bus) can be used to 
provide control over each input's level, pan, effects, etc., as well as 
to route this mix to the control room's mono, stereo or surround 
speakers (Figure 13.20). The approach and techniques of monitor- 
ing tracks during a recording will often vary from mixer to mixer 
(as well as among individuáis). Again, no method is right or 
wrong compared to another. It simply depends on what type of 
equipment you're working with, and on your own personal 
working style. 


Dynamics section of 
the Solid State Logic 
SL 9000 K. (Courtesy 
of Solid State Logic, 
www.solid-state- 
logic.com.) 


Note that during the overdub and general production phases, this monitor 
phase can be easily passed over in favor of monitor mixing the signal levels 
directly at the main faders in a standard mixdown environment. The straight- 
forward system lets us pass go and collect $200 by setting up a rough mix all 
through the production phase, allowing us to finesse the mix during production. 
By the time the final mix rolls around, many or most of the "mix as you go" 
and automation kinks will have been worked out. 


IN-LINE MONITORING 

Most hardware consolé designs incorpórate an I/O small fader section that can 
be used to directly feed the recorded signal that's being fed to either the multi- 
track recorder or the monitor mixer (depending on its selected operating mode). 
In the standard monitor mix mode (Figure 13.21a), the small fader is used to 
adjust the monitor level for the associated recording track. In the flipped mode 
(Figure 13.21b), the small fader is used to control the signal level that's being 
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FIGURE 13.20 w 

Monitor mix sections: (a) Mackie 8-bus consolé (courtesy of Loud Technologies, Inc., www.mackie.com); (b) Solid State 
Logic XL 9000 K (courtesy of Solid State Logic, www.solid-state-logic.com); (c) software monitor section within the 
Cubase/Nuendo virtual mixer (courtesy of Steinberg Media Technologies GmbH, www.steinberg.net.) 
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(b) 

FIGURE 13.21 

Small-fader monitor 
modes: (a) standard 
monitor mode; 

(b) flipped monitor 
mode. 





sent to the recording device, while the larger, 
main fader is used to control the monitor mix 
levels. This function allows multitrack record 
levels (which aren't often changed during a 
session) to be located out of the way, while 
the more frequently used monitor levels are 
assigned to the larger, more accessible master 
faders. 

SEPARATE MONITOR SECTION 

Certain British consoles (particularly those of 
older design) incorpórate a sepárate mixing 
section that's dedicated specifically to the task 
of mixing the monitor feed. Generally located 
on the console's right-hand side (Figure 13.22), 
the inputs to this section are driven by the 
console's multitrack output and tape return 
buses, and offer level, pan, effects and "fold- 
back" (an older British word for headphone 
monitor control). During mixdown, this type 
of design has the distinct advantage of offering 
a large number of extra inputs that can be 
assigned to the main output buses during 
mixdown for use with effects returns, elec- 
tronic instrument inputs and so on. During a 
complex recording session, this monitoring 
approach will often require an extra amount 
of effort and concentration to avoid confusing 
the inputs that are being sent to tape or 
DAW with the corresponding return strips 
that are being used for monitoring (which is 
probably why this design style has fallen out 
of favor). 


DIRECT INSERTION MONITORING 

If a consolé doesn't have any of the preceding 
monitor facilities (or even if it does), a simple 
and effective third option is still available to 
the user. A system known as direct insertion 
monitoring makes use of the direct send/returns of each input strip to inserí 
the recording device directly into the input strip's signal path. LJsing this 
approach: 


■ The direct send for each of the associated tracks (which can be either 
before or after the EQ section) is routed to the associated track input on 
a multitrack recording device (Figure 13.23). 
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FIGURE 13.22 

Older style English 
consoles may have a 
sepárate monitor 
section (circled), 
which is driven 
by the console’s 
multitrack output 
and/or tape return 
buses. (Courtesy 
of Buttermilk 
Records, www. 
buttermilkrecords. 
com.) 


The return signal is then routed from the record¬ 
er's output back into the same input strip's 
return path (where it injects the signal back into 
the strip's effects send, pan and main fader sec- 
tions for monitoring). 
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With this approach, the input signal directly following 
the mic/line preamp will be fed to the recorder (with 
record levels being adjusted by the preamp's gain 
trim). The return path (from the recorder) is then fed 
back into the input strip's signal path so it can be 
mixed (along with volume, pan, effects, sends, etc.) 
without regard for the levels that are being recorded 
to tape, disk or other médium. Playing back the track 
or placing any track into the sync mode won't affect 
the overall monitor mix at all—because the recorder's 
outputs are already being used to drive the console's 
monitor mix signáis. Direct insertion monitoring can 
be used effectively on mixers and larger consoles alike, 

with the only potential drawback being the large number of patch cords that 
might be required to patch the inputs and outputs to and from a multitrack 
recorder. 

Channel fader 

Each input strip contains an associated channel fader (which determines the 
strip's bus output level) (Figures 13.24 and 13.25) and pan pot (which is often 
designed into or near the fader and determines the signal's left/right placement 



return from device into monitor path 



FIGURE 13.23 

A track can be easily 
recorded and 
monitored by directly 
inserting the 
recording device into 
a strip’s send and 
return signal path, 
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FIGURE 13.24 

Output fader section 
of the Solid State 
Logic SL 9000 K. 
(Courtesy of Solid 
State Logic, www. 
solid-state-logic. 
com.) 



in the stereo and/or surround field). Generally, 
this section ineludes a solo/mute feature, which 
performs the following functions: 

■ Solo: When pressed, the monitor outputs 
for all other channels will be muted, 
allowing the listener to monitor only the 
selected channel (or soloed channels) 
without affecting the multitrack or main 
stereo outputs during the recording or 
mixdown process. 

■ Mute: This function is basically the oppo- 
site of the solo button, in that when it 
is pressed the selected channel is cut or 
muted from the main and/or monitor 
outputs. 

Depending on the hardware mixer or audio 
interface design, the channel fader might be 
motorized, allowing automation moves to be 
recorded and played back in the physical motion 
of moving faders. In the case of some of the 
high-end consolé and audio interface/control¬ 
ler designs, a flip fader mode can be called up 
that literally reassigns the control of the monitor 
section's fader to that of the main channel fader. 
This "flip" allows the monitoring of levels 
during the recording process to be controlled 
from the larger, long-throw faders. In addition 
to swapping monitor/channel fader functions, 
certain audio interface/controller designs allow 
a number of functions such as panning, EQ, 
effeets sends, etc., to be swapped with the main 
fader, allowing these Controls to be finely tuned 
under motorized control. 

Output section 

In addition to the concept of the signal path as 
it follows through the chain, there's another 
important signal path concept that should be 
understood: output bus. From the above input 
strip discussion, we've seen that a channel's 
audio signal by and large follows a downward 
path from its top to the bottom; however, when 
we take the time to follow this path, it's easy to 
spot where audio is routed off the strip and 
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FIGURE 13.25 

Hardware and 
software pan pot 
configurations: 

(a) stereo pan left/ 
right control; 

(b) surround pan 
control. 
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FIGURE 13.26 

Example of an effects 
send bus, whereby 
múltiple inputs are 
mixed and routed 
to a master send 
output. 
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onto a horizontal output path. Conceptually, we can think of this path (or bus) 
as a single electrical conduit that runs the horizontal length of a consolé or 
mixer (Figure 13.26). Signáis can be inserted onto or routed off of this bus at 
múltiple points. 

Much like a city transit bus, this signal path follows a specific route and allows 
audio signáis to get on or off the line at any point along its path. Aux sends, 
monitor sends, channel assignments, and main outputs are all examples of 
signáis that are injected into buses for routing to one or more output destina- 
tions; for example, the aux send Controls are horizontally duplicated across a 
console's surface. These gain Controls are physically tied to an auxiliary send 
bus that routes the mixed levels to an output destination. The main stereo or 
surround buses feed to the channel faders and pan positioners and then onto 
the mixer's main output buses, which are combined with the various effects 
return signáis and routed to the monitor speakers and/or recording device. 

As you might expect, a number of newer consoles and mixer designs also 
inelude a built-in FireWire and USB2 audio interface that can directly route 
audio signáis to and from the hardware control/mixer surface in a straightfor- 
ward manner. 

8 Channel assignment 

After the channel output fader on a consolé or larger mixer, the signal is often 
routed to the strip's track assignment matrix (Figure 13.27), which is capable 
of distributing the signal to any or all tracks of a connected multitrack recorder. 
Although this section electrically follows either the main or small fader section 
(depending on the channel's monitor mode), the track assign buttons will often 
be located either at the top of the input strip or designed into the main output 
fader (often being placed at the faders right-hand side). Functionally, pressing 
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FIGURE 13.27 

Channel assignment section: (a) Mackie 8-bus (courtesy of Loud Technologies, Inc., www.mackie.com); (b) Neve 88RS 
(courtesy of AMS Neve pie, www.ams-neve.com); (c) Solid State Logic XL 9000 K (courtesy of Solid State Logic, www. 
solid-state-logic.com.) 
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any or all assignment buttons will route the input strip's main signal to the 
corresponding track output buses. For example, if a vocal mic is plugged into 
channel 14, the engineer might assign the signal to track 14 by pressing (you 
guessed it) the "14" button on the matrix. If a quick overdub on track 15 is also 
needed, all the engineer has to do is unpress the "14" button and reassign the 
signal to track 15. 

Many newer consoles offer only a single button for even- and odd-paired tracks, 
which can then be individually assigned by using the strip's main output pan 
pot. For example, pressing the button marked "5/6" and panning to the left 
routes the signal only to output bus 5, while panning to the right routes it to 
bus 6. This simple approach accomplishes two things: 

■ Fewer buttons need to be designed into the input strip (lowering produc- 
tion costs and reducing the number of moving parts). 

■ Panning instruments within a stereo soundfield and then assigning their 
outputs to a pair of tracks on the multitrack recorder can easily build up 
a stereo submix. 

1 Grouping 

Many consoles and professional mixing systems allow any number of input 
channels to be organized into groups. Such groupings allow the overall relative 
levels of a series of channels to be interlinked into organized groups according 
to instrument or scene change type. This important feature makes it possible 
for múltiple instruments to retain their relative level balance while offering 
control over their overall group level from a single fader or stereo fader pair. 
Individual group bus faders often have two functions. They: 

■ Vary the overall level of a grouped signal that's being sent to a recorded 
track. 

■ Vary the overall submix level of a grouped signal that's being routed to 
the master mix bus during mixdown. 

The obvious advantage to grouping channels is that it makes it possible to avoid 
the dreaded and unpredictable need to manually change each channel volume 
individually. Why try to move 20 faders when you can adjust their overall levels 
from just one? For example, the numerous tracks of a string ensemble and a 
drum mix could each be varied in relative level by assigning them to their own 
stereo or surround sound groupings and moving a single fader—ahhhh ... 
much easier! 

On an analog mixer, grouping is simply accomplished by assigning the various 
channels in the desired group to their own output bus (Figure 13.28). During 
mixdown, each instrument group bus can be summed into the main stereo or 
surround output through the use of pan pots or L/R assignment buttons. 

In the digital or workstation domain, the busing process is actually easier in 
that any number of channels can be grouped together by assigning their relative 
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FIGURE 13.28 

Simplified anatomy 
of the output section 
on the Mackie 8-bus 
analog consolé. 
(Courtesy of Loud 
Technologies, Inc., 
www.mackie.com.) 
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FIGURE 13.29 

Monitor level section 
for the Mackie 8-bus. 
(Courtesy of Loud 
Technologies, Inc., 
www.mackie.com.) 


fader level control to a single digital control signal. This level can then be con- 
trolled from a single fader or on-screen automation Controls. 

Monitor level section 

Most consolé and mixing systems inelude a central monitor section that Controls 
levels for the various monitoring functions (such as control room level, studio 
level, headphone levels and talkback). This section (Figure 13.29) often makes 
it possible to easily switch between múltiple speaker sets and can also provide 
switching between the various input and recording devi ce sources that are found 
in the studio (e.g., surround, stereo and mono output buses; tape returns; aux 
send monitoring; solo monitoring). 

Patch bay 

A patch bay (Figure 13.30) is a panel that's found on larger consoles which 
(under the best of conditions) contains accessible jacks that correspond to the 
various inputs and outputs of every access point within a mixer or recording 
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(a) 


FIGURE 13.30 

The patch bay: (a) 
Ultrapatch PX2000 
patch bay (courtesy 
of Behringer 
International GmbH, 
www.behringer.de); 
(b) rough example of 
a labeled patch bay 
layout. 



(b) 


consolé. Most professional patch bays (also known as patch panels) offer cen- 
tralized 1/0 access to most of the recording, effects and monitoring devices or 
system blocks within the production facility (as well as access points that can 
be used to connect between different production rooms). 

Patch bay systems come in a number of plug and jack types as well as wiring 
layouts. For example, prefabricated patch bays are available using tip-ring- 
sleeve (balanced), or tip-sleeve (unbalanced) 1/4-inch phone configurations, 
as well as RCA (phono) connections. These models will often place intercon- 
nected jacks at the panel's front and rear so that studio users can reconfigure 
the panel simply by rearranging the plugs at the rear access points. Other profes¬ 
sional systems using the professional telephone-type (TT or mini Bantam-TT) 
plugs often require that you hand-wire the connections in order to configure or 
reconfigure a bay (usually an amazing feat of patience, concentration and 
stamina). 

Patch jacks can be configured in a number of ways to allow for several signal 
connection options among inputs, outputs and external devices (Figure 13.31): 

■ Open : When no plugs are inserted, each 1/0 connection entering or leaving 
the panel is independent of the other and has no electrical connection. 

■ Half-normaled: When no plugs are inserted, each 1/0 connection entering 
the panel is electrically connected (with the input being routed to the 
output). When a jack is inserted into the top jack, the in/out connection 
is still intact, allowing you to tap into the signal path. When a jack is 
inserted into the bottom jack, the in/out connection is broken, allowing 
only the inserted signal to pass to the input. 

■ Normaled : When no plugs are inserted, each 1/0 connection entering the 
panel is electrically connected (with the input routing to the output). 
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FIGURE 13.31 

Typical patch bay 
signal routing 
schemes. (Courtesy 
of Behringer 
International GmbH, 
www.behringer.de.) 


When a jack is inserted into the top jack, the in/out connection is broken, 
allowing the output signal to pass to the cable. When a jack is inserted 
into the bottom jack, the in/out connection is broken, allowing the input 
signal to pass through the inserted cable connection. 

■ Parallel : In this mode, each I/O connection entering the panel is electrically 
connected (with the input routing to the output). When a jack is inserted 
into either the top or bottom jack, the in/out connection will still be 
intact, allowing you to tap into both the signal path's inputs and outputs. 

Breaking a normaled connection allows an engineer to patch different or addi- 
tional pieces of equipment into a circuit that's normally connected. For example, 
a limiter might be temporarily patched between a mic preamp output and an 
equalizer input. The same preamp/EQ patch point could be used to inserí an 
effect or other device type. These useful access points can also be used to bypass 
a defective component or to change a signal path order .... Versatility is defi- 
nitely the ñame of the game here! 

Metering 

The level of a signal's strength, at an input, output bus and other consolé level 
point, is often measured by visual meter display (Figure 13.32). Meter and 
indicator types will often vary from system to system. For example, banks of 
readouts that indicate consolé bus output and tape return levels might use VU 
metering, peak program meters (PPMs, found in European designs) or digital/ 
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FIGURE 13.32 

A set of LED, 
light-bar and VU 
meter displays. 
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software readouts. It's also not uncommon to find LED overload indicators on 
an input strip's preamp, which give quick and easy peak indications as to 
whether you've approached or reached the component's headroom limits (a 
sure sign to back off your levels). 

The basic rule regarding levels isn't nearly as rigid as you might think and will 
often vary depending on whether the device or recording médium is analog or 
digital. In short, if the signal level is too low, tape, amp and even digital noise 
could be a problem, because the levels throughout the signal chain will prob- 
ably not be optimized. If the level is too high, overloaded preamps, saturated 
tape or clipped digital converters will often result in a distorted signal. Here are 
a few rules of thumb: 

■ In analog recording, the proper recording level is achieved when the 
highest reading on the meter is near the zero level, although levels slightly 
above or below this might not be a problem, as shown in Figure 13.33. 
In fact, (slightly) overdriving some analog devices and tape machines will 
often result in a sound that's "rough" and "gutsy." 

■ When recording digitally, noise is often less of a practical concern (espe- 
cially when higher bit rates are used). It's often a good idea to keep levels 
at a reasonable level (i.e., peaking at about -12 on the meter), while 
keeping a respectful distance from the dreaded clip or "over" indicator. 
Llnlike analog, digital is usually very unforgiving of clipped signáis and 
will generate grunge that's guaranteed to make you eringe! Because there 
is no real standard for digital metering levels beyond these guidelines, 
many feel that giving a headroom margin that's 12 dB below "0" full scale 
is usually a wise precaution. 


It’s important to keep in mind that recoded distortion isn’t your ears might hear them after the damage is done, 
always easy (or possible) to fix. You might not be able to When in doubt, back off ... and give your levels some 
see the peak levels that could distort your soundfiles, but breathing room. 
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FIGURE 13.33 

VU meter readings 
for analog devices: 
(a) too low; (b) too 
high; (c) just right. 


(c) 
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In addition to the above options, a number of software plug-in metering systems 
are available for use with your DAW system. These displays will often provide 
a range of metering options that allow for a more accurate readout at any plug-in 
input or output stage. 

GAIN LEVEL OPTIMIZARON 

As we near the end of our discussion on consolé and mixer layouts, it's extremely 
important that we touch base on the concept of signal flow or gain level optimi- 
zation. In fact, the idea of optimizing levels as they pass from one functional 
block in an input strip to the next or from one device to another is one of the 
more important concepts to be grasped in order to create professional-quality 
recordings. Although it's possible to go into a great deal of math in this section, 
I feel that it's far more important that you understand the underlying principies 
of level optimization, internalize them in everyday practice and let common 
sense be your guide. For example, it's easy to see that, if a mic that's plugged 
into an input strip is overdriven to the point of distortion, the signal following 
down the entire path will be distorted. By the same notion, driving the mic 
preamp at too low a signal will require that it be excessively boosted at a later 
point in the chain, resulting in increased noise. From this, it follows that the 
best course of action is to optimize the signal levels at each point in the chain 
(regardless of whether the signal path is within an input strip, internal to a 
specific device, or pertains to I/O levels as they pass from one device to another 
throughout the studio as shown in Figure 13.34). 

From a practical standpoint, level adjustments usually begin at the mic preamp. 
Although many of these points are situational and even debatable, one straight- 
forward approach to setting proper gains on an input strip is to set the gain on 
the main strip fader (and possibly the master and group output faders) to 0 dB 
(unity gain). While monitoring levels for that channel or channel grouping, 
turn the mic preamp up until an acceptable gain is reached. Should the input 
overload LED light up, back off on the input level and adjust the output gain 
structure accordingly. Care should be taken when inserting devices into the 
signal chain at a direct insert point, making sure that the in and out signáis are 
at or near their optimum level. In addition, the EQ section can also cause level 
overload problems whenever a signal is overly boosted within a frequency range. 

Although this falls slightly outside of the scope of consolé design and operation, 
I'd like to briefly touch on the related topic of proper grounding and power 


FIGURE 13.34 

Gain stage chart 
showing the 
various level stages 
throughout the 
1202 vlz pro mixer. 
(Courtesy of Loud 
Technologies, Inc., 
www.mackie.com.) 
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conditioning issues, because system interconnection and the centralized concept 
of the consolé or audio interface often go hand-in-hand. 


POWER- AND GROUND-RELATED ISSUES 

Proper grounding is essential to maintaining equipment safety; however, within 
an audio facility, small AC voltage potentials between various devices in an 
audio system can leak into a system's grounding circuit. Although these poten¬ 
tials are small, they are sometimes large enough to induce noise in the form of 
hums, buzzes or radio-frequency (RF) reception that can be injected (and ampli- 
fied) directly into the audio signal path. These unwanted signáis generally occur 
whenever improper grounding allows a piece of audio equipment to detect two 
or more different paths to ground. 

Because grounding problems arise as a result of electrical interactions between 
any number of equipment combinations, the use of proper grounding tech- 
niques and troubleshooting within an audio production facility are by their very 
nature situational and often frustrating. As such, the following procedures are 
simply a set of introductory guidelines for dealing with this age-old problem. 
There are a great number of technical papers, books, methods and philosophies 
on grounding, and it's recommended that you carefully research the subject 
further befo re tackling any major ground-related problems. When in doubt, an 
experienced professional should be contacted, and care should be taken not to 
sacrifice safety. 

■ Keep all studio electronics on the same AC electrical circuit—most stray 
hums and buzzes occur whenever parts of the sound system are plugged 
into outlets from different AC circuits. Plugging into a circuit that's con- 
nected to such noise-generating devices as air conditioners, refrigerators, 
light dimmers, neón lights, etc., will definitely invite stray noise problems. 
Because most project studio devices don't require a lot of current (with 
the possible exception of power amplifiers), it's often safe to run all of the 
devices from a single, properly grounded line from the electrical circuit 
panel. 

■ Keep audio wiring away from AC wiring—whenever AC and audio cables 
are laid side-by-side, portions of the 60-Hz signal might be induced 
into a high-gain, unbalanced circuit as hum. If this occurs, check to see 
if separating or shielding the lines helps reduce the noise. When all 
else fails: 

■ If you only hear hum coming from a particular input channel, check that 
device for ground-related problems. If the noise still exists when the 
consolé or mixer inputs are turned down, check the amp circuit or any 
device that follows the mixer. If the problem continúes to elude you, 
then ... 

■ Disconnect all of the devices (both power and audio) from the consolé, 
mixer or audio interface, then methodically plug them back in one at a 
time (it's often helpful to monitor through a pair of headphones). 



Power- and Ground-Related Issues 


■ Check the cables for bad connections or improper polarity. It's also 
wise to keep the cables as short as possible (especially in an unbalanced 
Circuit). 

■ Another common path for ground loops is through a chassis into a 
19-inch rack that not only provides a mount for múltiple devices, but a 
common ground path. Test this by removing devices from the rack, one 
at a time. If needed, a device can be isolated from the rack by special nylon 
mounting washers. 

■ Investígate the use of a balanced power source, if traditional grounding 
methods don't work. 

Troubleshooting a ground-related problem can be tricky and finding the prob- 
lem's source might be a needle-in-a-haystack situation. When putting on your 
troubleshooting hat, it's usually best to remain calm, be methodical and consult 
with others who might know more than you do (or might simply have a fresh 
perspective). 

Balanced power 

For those facilities that are located in areas where power lines are overtasked by 
heavy machinery, air conditioners and the like, a balanced power source might 
be considered. Such a device makes use of a power transformer (Figure 13.35) 
that has two secondary windings, with a potential to ground on each side of 
60 V. Because each side of the Circuit is 180° out of phase with the other, a 
120-V supply is maintained. Also, since the two 60-V legs are out of phase, any 
hum, noise or RF that's present at the device's input will be canceled at the 
transforméis center tap (a nuil point that's tied to ground). 

A few important points relating to a balanced power circuit inelude: 

■ A balanced power circuit is able to reduce line noise if all of the system's 
gear is plugged into it. As a result, the device must be able to deliver 
adequate power. 

■ Balanced power will not elimínate noise from gear that's already sensitive 
to hums and buzzes. 


FIGURE 13.35 

Furman IT-1220 
balanced-output 
power conditioner. 
(Courtesy of Furman 
Sound, Inc., www. 
furmansound.com.) 
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■ When to use balanced power is often open to interpretation, depending 
on who you talk to. For example, some feel that a balanced power condi- 
tioner should be used only after all other options to elimínate noise have 
been explored, while others believe it is a starting point from which to 
build a noise-free environment. 

Power conditioning 

Whether your facility is a project studio or full-sized professional facility, it's 
often a wise idea to regúlate and/or isolate the voltage supply that's feeding one 
of your studio's most precious investments (besides you and your staff)—the 
equipment! This discussion of power conditioning can basically be broken 
down into three topics: 

■ Voltage regulation 

■ Eliminating power interruptions 

■ Keeping the lines quiet. 

In an ideal world, the power that's being fed to your studio outlets should be 
very cióse to the standard reference voltage of the country you are working in 
(e.g., 120 V, 220 V, 240 V). The real fact of the matter is that these line voltages 
regularly fluctuate from this standard level, resulting in voltage sags (a condition 
that can seriously underpower your equipment), surges (rises in voltage that 
can harm or reduce the working life of your equipment), transient spikes (sharp, 
high-level energy surges from lightning and other sources that can do serious 
damage) and brown-outs (long-term sags in the voltage lines). Through the use 
of a voltage regulator, high-level, short-term spikes and surge conditions can be 
clamped, thereby reducing or eliminating the chance that the main voltage will 
rise above a standard, predetermined level. 

Certain devices that are equipped with voltage regulation circuitry are able to 
deal with power sags, long-term surges and brown-outs by electronically switch- 
ing between the múltiple voltage level taps of a transformer so as to match the 
output voltage to the ideal main level (or as cióse to it as possible). One of the 
best approaches for regulating voltage fluctuations both above and below 
nominal power levels is to use an adequately powered uninterruptible power 
supply (UPS). In short, a quality UPS works by using a regulated power supply 
to constantly charge a rechargeable battery or bank of batteries. This battery 
supply is again regulated and used to feed sensitive studio equipment (such as 
a Computer, bank of effects devices, etc.) with a clean and constant voltage 
supply. 

DIGITAL CONSOLE AND DAW VIRTUAL 
MIXER TECHNOLOGY 

Consolé design, signal processing and signal routing technology have under- 
gone a tremendous design revolution with the advent of digital audio. Digital 
consoles, digital mixers, software DAW mixers and controller designs are finding 
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their ways into professional, project and audio production facilities at an 
amazing pace. These systems use large-scale integrated circuits and central pro- 
cessors to convert, process, route and interface to external audio and computer- 
related devices with relative ease. In addition, this technology makes it possible 
for many of the costly and potentially faulty discrete switches and level Controls 
that are required for such functions as track selection, gain and EQ to be 
replaced by assignable digital networks. The big bonus, however, is that since 
routing and other control functions are digitally encoded, it becomes a simple 
matter for level, patch and automation settings to be saved into memory for 
instantaneous recall at any time. 

From a functional standpoint, the basic signal flow of a digital consolé or 
mixer's input strip is similar to that of an analog consolé, in that the input must 
first be boosted in level by a mic/line preamp (where it will be converted into 
digital data). From this point, the signáis can pass through EQ, dynamics and 
other signal processing blocks; and various effects and monitor sends, volume, 
routing and other sections that you might expect, such as main and group 
output fader Controls. From a control standpoint, however, sections of a digital 
consolé might be laid out in a manner that's completely different. The main 
difference is a fundamental and philosophical change in the concept of the 
input strip itself. 

By the very nature of analog strip design, all of the Controls must be duplicated 
for each strip along the consolé. This is the biggest contributing factor to cost 
and reliability problems in traditional analog design. Since most (if not all) of 
the functions of a digital consolé pass through the devices central processor, 
this duplication is neither necessary ñor cost effective. As a result, designers have 
often opted to keep the most commonly used Controls (such as the pan, solo, 
mute and channel volume fader) in their traditional input strip position. 
However, Controls such as EQ, input signal processing, effects sends, monitor 
levels and (possibly) track assignment have been designed into a central control 
panel (Figure 13.36) that can be used to vary a particular channel's setting 
parameters. These Controls can be quickly and easily assigned to a particular 
input strip by pressing the "select" button on the relevant input strip channel. 

In certain designs, each input strip may be fitted with a virtual pot (V-pot) that 
can be assigned to a particular parameter for instant access, with position and 
level readouts being displayed via FEDs around the control. In others, the main 
parameter panel can be multipurpose in its operation, allowing itself to be 
reconfigured in a chameleon-like fashion to fit the task at hand. In such cases, 
touch-screen displays can be used to provide for an infinite degree of user 
control, while others might use software "soft" buttons that can easily reconfig¬ 
ure the form and function of the various buttons and dial Controls in the control 
panel. Finally, consolé parameters can be controlled by physically placing a 
readout display at their traditional control points on the input strip. Such a 
"knob per function" control/display system lets you simply grab the desired 
parameter and alter it (as though it were an analog consolé). Generally, this 
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FIGURE 13.36 

Centralized control 
panel for the Sony 
DMX-R100 digital 
consolé. (Courtesy of 
Sony Electronics, 

Inc., www.sony.com/ 
proaudio.) 


system is more expensive than those using a central control panel, because more 
readout indicators, control knobs and digital control interface Systems are 
required. The obvious advantage to this interface is that you'd have instant 
access (both physically and visually) to any and all of these parameters at once, 
in an analog fashion. 

From the above, it's easy to see that digital consoles often vary widely in their 
layout, ease of operation and cost effectiveness. As with any major system devi ce, 
it's often a good idea to familiarize yourself with the layout and basic functions 
before buying or taking on a project that involves such a beastie. As you might 
expect, operating manuals and basic tutorials are often available on the various 
manufacturers' websites. 


0 I Y 

do it yourself 


Tutorial: Channel Select 


1. Find yourself a digital mixer, consolé or DAW with a 
software mixer. (If one’s not nearby, you could visit a 
friend or a friendly studio or musió store.) 

2. Feed several tracks into it and create a simple mix. 


3. Press the channel select for the first input and 
experiment with the EQ and/or other Controls. 

4. Move on to the next channel and have fun building 
your mix. 


The increased power of the computer-based digital audio workstation has 
brought about a new, powerful and cost-effective audio mixing interface—the 
DAW software mixer (Figure 13.37). Through the use of a traditional (and 
sometimes not-so-traditional) user interfacing, these mixers offer functional 
on-screen control over levels, panning, EQ, effects, DSP, mix automation and a 
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FIGURE 13.37 

ProTools M-Powered 
on-screen mixer. 
(Courtesy of 
Digidesign, a división 
of Avid Technology, 
Inc., www.digidesign. 
com.) 
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host of functions that are too long to list here. Often these software consoles 
emulate their hardware counterparts by offering basic Controls (such as fader, 
solo, mute, select and routing) in the virtual input strip. Likewise, pressing the 
Select button will assign a multitude of virtual parameters on the central control 
panel to that strip. 

When using a software mixer in conjunction with a DAW's waveform/edit 
window on a single monitor screen, it's easy to feel squeezed by the lack of 
visual "real estáte." For this reason, many opt for a dual-monitor display arrange- 
ment. Whether you are working in a PC or Mac environment, this working 
arrangement is easier and more cost effective than you might think and the 
benefits will immediately become evident, no matter what audio, graphics or 
program environment you're working on. 

When using a DAW controller (Figure 13.38), the flexibility of the DAW's soft¬ 
ware environment can be combined with the hands-on control and motorized 
automation (if available) of a hardware controller surface. More information 
on the DAW software mixing environment and hardware controllers can be 
found in Chapter 7. 

Mixdown level and effects automation 

One of the great strengths of the digital age is how easily all of the mix and 
effects parameters can be automated and recalled within a mix. Although many 
large-scale consoles are capable of automation, computer-based DAW systems 
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FIGURE 13.38 

Mackie Universal 
Control. (Courtesy of 
Loud Technologies, 
Inc., www.mackie. 
com.) 


are particularly strong in this area, allowing complete control of all parameters 
throughout all phases of a production. The beauty of being able to write, read 
and recall a mix is that it allows us to build the mix over time, and even save 
múltiple mix versions, so that we can go back in time and explore other produc¬ 
tion avenues. In short, the job of mixing becomes much less of a chore, allow¬ 
ing us to pursue less of the technology of mixing ... and more of the art of 
mixing. What could be bad about that? 

Although terminologies and functional offerings will differ from one system to 
the next, control over the basic automation functions will be carried out in one 
of two operating modes (Figure 13.39): 

■ Write mode 

■ Read mode. 

Write mode 

Once the mixdown process has gotten under way, the process of writing the 
automation data into the system's memory can begin (actually, that's not entirely 
true, because basic mix moves will often begin during the recording or overdub 
phase). When in the write mode, the system will begin the process of encoding 
mix moves for the selected channel or channels in real time. This mode is used 
to record all of the settings and moves that are made on a selected strip or strips 
(allowing track mixes to be built up individually) or on all of the input strips 
(in essence, storing all of the mix moves, live and in one pass). The first 
approach can help us to focus all of our attention on a difficult or particularly 
important part or passage. Once that channel has been finessed, another channel 
or group of channels can then be written into the system's memory ... and then 
another, until an initial mix is built up. 

Often, modern automation systems will let us update previously written auto¬ 
mation data by simply grabbing the fader (either on-screen or on the consolé/ 
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FIGURE 13.39 

Automation mode 
selections: (a) Pro 
Tools showing auto 
selectors within 
the edit and mixer 
screens (courtesy of 
Digidesign, a división 
of Avid Technology, 
Inc., www.digidesign. 
com); (b) Cubase/ 
Nuendo showing 
auto selectors within 
the edit and mixer 
screens (courtesy 
of Steinberg Media 
Technologies GmbH, 
www.steinberg.net.) 
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controller) and moving it to the newly desired position. Once the updated move 
has been made, the automation will remain at that level or position until a 
previously written automation move is initiated, at which point the valúes will 
revert to the existing automation settings. 

Whenever data is updated over previously written automation on an older 
analog mixer or a DAW controller that isn't equipped with moving faders, 
the concept of matching current fader level or controller positions to their pre¬ 
viously written position becomes important. For example, let's say that 
we needed to redo several track moves that occur in the middle of a song. If 
the current controller positions don't match up with their previously written 
points, the mix levels could jump during the updated transition (and thus 
during playback). 

Read mode 

An automated consolé or DAW that has been placed into the read mode will 
play the mix information from the system's automation data, allowing the on- 
screen and moving faders to follow the written mix moves in real time. Once 
the final mix has been achieved, all you need to do is press play, sit back and 
listen to the mix. 

Drawn (rubberband) automation 

In addition to physically moving on-screen and controller faders under read/ 
write automation control, one of the most accurate ways to control various 
automation parameters on a DAW is through the drawing and editing of on- 
screen rubberbands. These useful tools offer a simple, graphic form of automa¬ 
tion that lets us draw fades and complicated mix automation moves over time. 
This user interface is so-named because the graphic lines (that represent the 
relative fade, volume, pan and other parameters) can be bent, stretched, and 
contorted like a rubberband (Figure 13.40). 

Commonly, all that's needed to define a new mix point is to click on a point 
on the rubberband (at which point a box handle will appear) and drag it to the 
desired position. You can change a move simply by clicking on an existing 
handle (or range of handles) and moving it to a new position. 

MIXING AND BALANCING BASICS 

Once all of the tracks of a project have been recorded, assembled and edited, 
it's time to put the above technology to use to mix your tracks into their final 
media forms. The goal of this process is to combine audio, MIDI and effects 
tracks into a pleasing form that makes use of such traditional tools as: 

■ Relative level 

■ Spatial positioning (the physical panned placement of a sound within a 
stereo or surround field) 

■ Equalization (affecting the relative frequency balance of a track) 


Mixing and Balancing Basles 




FIGURE 13.40 

Automation 
rubberbands: (a) Pro 
Tools (courtesy of 
Digidesign, a división 
of Avid Technology, 
Inc., www.digidesign. 
com); (b) Cubase/ 
Nuendo (courtesy of 
Steinberg Media 
Technologies GmbH, 
www.steinberg.net.) 




■ Dynamics processing (altering the dynamic range of a track, group or 
output bus to optimize levels or to alter the dynamics of a track so it fits 
within a mix) 

■ Effects processing (adding reverb-, delay- or pitch-related effeets to a mix 
in order to augment or alter the piece in a way that is natural, unnatural 
or just plain interesting. 
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far field 


mid field 


front field 


Figure 13.41, for example, shows how sounds can be 
built up and placed into a sonic stage through the 
use of natural, psychoacoustic and processed signal 
cues to create a pleasing, interesting and balanced 
soundscape. Now, it's pretty evident that volume can 
be used to move sound forward and backward within 
the soundfield, and that relative channel levels can 
be used to position a sound within that field. It's less 
obvious that changes in timbre (often but not always 
introduced through EQ), delay and reverb can be 
used to move sounds within the stereo or surround 
field. All of this sounds simple enough; however, the 

dedication that's required to hone your skills within this evolving art is what 
mixing careers are made of. 

OK ... let's take a moment to walk through a fictitious mix. Remember, there's 
no right or wrong way to mix as long as you watch your levels along the signal 
path. There's no doubt that, over time, you'll develop your own sense of style. 
The important thing is to keep your ears open, care about the process and make 
it sound as good as you can. 



pan 


R 


FIGURE 13.41 

Graphic 

representaron of 
the stereo playback 
soundfield. 



Let's begin building the mix by setting the output volumes on each of the instru- 
ments to a level that's acceptable to your main mixer or consolé. From a practi- 
cal standpoint, you might want to set your tracks to unity gain or to some easily 
identifiable marking. 



The next step is to begin playing the project and changing the fader levels for 
any instrument or voice tracks until they begin to blend in the mix. Once that's 
done, you can sit back and listen to how the overall mix levels hold up over the 
course of the song. 



Should the mix need to be changed at any point from its initial settings, you 
might turn the automation on for that track and begin building up the mix. 
You might want to save your mix at various stages of its development under a 
different ñame (mymix 01, mybestmix 02, jimsnewapproach 03, etc.). This 
makes it easier to return to the point where you began to take a different path. 



You might want to group (or link) various instrument sections, so that overall 
levels can be automated. For example, during the bridge of a song you might 
want to reduce the volume on several tracks by simply grabbing a single fader 
... instead of moving each track individually. 
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This calis to mind a very important aspect of most types of production (actually 
it's a basic tenant in life): Keep it simple! If there's a trick you can use to make 
your project go more smoothly, use it. For example, most musicians interact 
with their equipment in a systematic way. To keep life simple, you might want 
to explore the possibility of creating a basic mixing témplate file that has all of 
the instruments and general assignments already programmed into it. 



Once you've begun to build your mix, you might want to create a rough mix 
that can be burned to CD. Take it out to the car, take it to a friend's studio, put 
it in your best/worst boom box and/or have a friend critique it. Take notes and 
then revisit the mix after a day. You'll be surprised at what you might find out. 


Need more inputs? 

It's no secret that modern multi-miking and electronic music techniques have 
had a major impact on the physical requirements of mixing hardware and soft¬ 
ware. This is largely due to the increased need for the large number of physical 
inputs, outputs and effects that are commonly encountered in the modern 
project and MIDI production facility. For those who are using a large number 
of hardware instruments and who are doing their production and mixing work 
on an analog or digital hardware mixer, it's easy to understand how we might 
run out of physical inputs when you're faced with synths that have four outputs 
and a drummer that needs 12 mies. With all of these issues and more, you can 
see how a system might easily outgrow your connection needs, leaving you with 
the unpleasant choice of either upgrading or dealing with your present system 
as best you can. Moral of the story: It's always wise to try to anticípate your 
future mixing requirements when buying an audio interface, consolé or mixer. 

One way to keep from running out of inputs on your main mixer or consolé is 
to use an outboard line mixer. These rack-mountable mixers (also known as 
submixers) are often equipped with extra line-level inputs, each having equal- 
ization, pan and effects send capabilities that can be mixed down to either two 
or four channels. These channels can then be used to free up a multitude of 
inputs on your main mixing device. 

For those who prefer to mix (or submix) within a DAW, the problems associated 
with the mixing of virtual instruments basically come down to a need for raw 
computing speed and power. In a virtual environment, any number of software 
instruments can be plugged and played into a workstation in real time, or the 
tracks can be transferred to disk and then mixed into the project as soundfile 
tracks (making sure to save all of the original instrument settings and MIDI 
files). These tracks can continué to add up within a session, but as long as the 
CPU "busy bar" indicator shows that the system is able to keep up with the 
Processing and disk demands, none of this should be a problem. 
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When using a workstation-based system, there are often a number of ways to 
tag on additional analog inputs. For example, a number of interface designs 
offer eight or more analog inputs; however, a number of these Systems offer 
ADAT lightpipe I/O. In short, this means that a number of mic preamp designs 
(usually having eight mic pres) that inelude ADAT lightpipe outputs can be 
plugged into a system, thereby adding eight more analog inputs to the interface. 

A FINAL FOOTNOTE ON THE ART OF MIXING 

Actually, the topic of the art of mixing could easily fill a book (and Im sure it 
has); however, I'd simply like to point out the fact that it is indeed an art form 
... and as such is a very personal process. I remember the first time I sat down 
at a consolé (an older Neve 1604). I was truly petrified and at a loss as to how 
to approach the making of my first mix. Am I overequalizing? Does it sound 
right? Will I ever get used to this sea of knobs? Well, folks, as with all things ... 
the answers come to you when you simply sit down and mix, mix, mix! It's 
always a good idea to watch others in the process of practicing their art and take 
the time to listen to the work of others (both the known and the not-so-well 
known). With practice, it's a foregone conclusión that you'll begin to develop 
your own sense of the art and style of mixing ... which, after all, is what it's all 
about. It's up to you now. Just dive into the deep end ... and have fun! 


This page intentionally left blank 



CHAPTER14 


Signal Processing 


Over the years, signal processing has become an increasingly important part of 
modern audio and music production. It's the function of a signal processor to 
change, augment or otherwise modify an audio signal in either the analog or 
digital domain. This chapter offers some insight into the basics of effects pro¬ 
cessing and how they can be integrated into a recording or mixdown in ways 
that sculpt sound using forms that are subtle, lush or just plain whimsical and 
wacky. 

Of course, the processing power of these effects can be harnessed in either a 
hardware or software plug-in form. Regardless of how you choose to work with 
sound, the important rule to remember is that there are no rules; however, there 
are a few general guidelines that can help you get the sound that you want. 
When using effects, the most important asset you can have is experience and 
your own sense of artistry. The best way to learn the art of processing, shaping 
and augmenting sound is through experience ... and gaining experience takes 
time, a willingness to learn and lots of patience. 

THE WONDERFUL WORLD OF ANALOG, 

DIGITAL OR WHATEVER 

Signal processing devices and their applied practices come in all sizes, shapes 
and flavors. These tools and techniques might be analog, digital or even acous- 
tic in nature (Figure 14.1). The very fact that early analog processors have made 
a serious comeback (in the form of reissued hardware and software plug-in 
emulations) points to the importance of embracing past tools and techniques 
while combining them with the technological advances of the day to make the 
best possible production. 

Although they aren't the first choice that comes to mind, the use of acoustic 
and ambient mic techniques is often the first line of defense when dealing 
with the processing of an audio signal. For example, as we saw in Chapter 4, 
changing a mic or its position might be the best option for changing the 
character of a pickup. Placing a pair of mies out into a room or mixing a guitar 
amp with a second, distant pickup might fill the ambience of an instrument in 
a way that a device just might not be able to duplicate with signal processing. 



The Wonderful World of Analog, Digital or Whatever 


FIGURE 14.1 

Effects rack at 
PatchWerk Recording 
in Atlanta, GA. 
(Courtesy of Russ 
Berger Design Group, 
Inc., www.rbdg.com.) 



In short, never underestimate the power of your acoustic environment as an 
effects tool. 

For those wishing to work in the world of analog, an enormous variety of 
devices can be put to use in a production. Although these generally relate to 
devices that alter a source's relative volume levels (e.g., equalization and dynam- 
ics), there are a number of analog devices that can be used to alter effects that 
are time based. For example, an analog tape machine can be patched so as to 
make an analog delay or regenerative echo device. Although they're not com- 
monly used, spring and píate reverb units that can add their own distinctive 
sound can still be found on the used and sometimes new market. 

On the other hand, the world of digital audio has definitely set signal Process¬ 
ing on fire by offering an almost unlimited range of effects that are available to 
the musician, producer and engineer. One of the biggest advantages to working 
in the digital signal processing (DSP) domain is the fact that software program- 
ming can be used to configure a processor in order to achieve an ever-growing 
range of effects (such as reverb, echo, delay, equalization, dynamics, pitch shift- 
ing or gain changing). 

The task of processing a signal in the digital domain is accomplished by com- 
bining logic circuits in a building-block fashion. These logic blocks follow basic 
binary computational rules that opérate according to a special program algo- 
rithm. When combined, they can be used to alter the numeric valúes of sampled 
audio in a highly predictable way. After a program has been configured (from 
either internal ROM, RAM or system software), complete control over a pro- 
gram's setup parameters can be altered and inserted into a chain as an effected 
digital audio stream. Since the process is fully digital, these settings can be saved 
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and precisely duplicated at any time upon recall. Even more amazing is how 
the overall quality and functionality have steadily increased while at the same 
time becoming more available and cost effective. It has truly brought a huge 
amount of production power to the beginner, as well as to the pro. 

PLUG-INS 

In addition to both analog and digital hardware devices, an ever-growing list of 
signal processors is available for the Mac and PC platforms in the form of soft¬ 
ware plug-ins. These software Utilities offer virtually every processing function 
imaginable (often at a fraction of the price of their hardware counterparts) with 
little or no reduction in quality, capabilities or automation features. These pro- 
grams (which are created and marketed by large and small companies alike) are 
designed to be integrated into an editor or DAW production environment in 
order to perform a particular real-time or non-real-time processing function. 

Currently, several plug-in standards exist, each of which functions as a platform 
that serves as a bridge to connect the plug-in through the computer's operating 
system (OS) to the digital audio production software. This means that any 
plug-in (regardless of its manufacturer) will work with an OS and DAW that's 
compatible with that platform standard, regardless of its form, function and/or 
manufacturer. As of this writing, the most popular standards are VST (PC/Mac), 
DirectX (PC), AudioSuite (Mac), Audio Units (Mac), MAS (MOTU for PC/Mac), 
as well as TDM and RTAS (Digidesign for PC/Mac). 

By and large, effects plug-ins opérate in a native processing environment. This 
means that the computer's host CPU processor carries out the DSP functions. 
With the ever-increasing speed and power of modern-day CPUs, this has become 
less and less of a problem; however, when working on a complex session it's 
still possible for your Computer to run out of DSP steam. This can be dealt with 
in several ways: 

■ Your Computer or processor can be beefed up in order to take full advan- 
tage of your system. 

■ Many DAWs offer a "freeze" or "lock" function that allows a track or pro¬ 
cessing function to be written to disc in a non-real-time fashion in order 
to free up the CPU for other real-time calculations. 

■ A DSP accelerator (Figure 14.2) can be plugged into your Computer to act 
as a coprocessor for sharing the CPU's processing workload. 

Plug-in control and automation 

One of the more fun and powerful aspects of working with various signal pro¬ 
cessing plug-ins on a DAW platform is the ability to control and automate many 
or all of the various effects parameters with relative ease (Figure 14.3). These 
Controls can be manipulated on-screen (via hands-on or track parameter Con¬ 
trols) or from an external hardware controller, allowing the parameters to be 
adjusted, recorded and recalled into a session track in real time. 


Signal Paths in Effects Processing 


FIGURE 14.2 

SSL Duende mini 
DSP accelerator 
plug-in system. 
(Courtesy of Solid 
State Logic, www. 
solid-state-logic. 
com.) 



SIGNAL PATHS IN EFFECTS PROCESSING 

Before delving into the process of effecting and/or altering sound, we should 
first take a quick look at an important signal path concept—the fact that a signal 
Processing device can be inserted into an analog or digital chain in several ways. 
The most common of these are: 

■ Inline routing 

■ Parallel routing. 

Inline routing 

Inline routing is often used to alter a signal. It occurs whenever a processor is 
inserted directly into a signal path in a serial fashion. Using this approach, the 
path passes from an audio source, through the signal processor and directly out 
to another device or point in the chain. This method for inserting a device is 
generally used for the processing of a single instrument, voice or grouped signáis 
that are present on a particular line. Often, but not always, this device tends to 
be a level-based processor (such as an equalizer, compressor or limiter). In 
keeping with the "no-rules" concept, time- and pitch-changing devices can also 
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be used to tweak an instrument or voice in the signal chain. Here 
are but a few examples of how inline routing can be used: 

■ A device can be plugged into an input strip's inserí (direct 
send/return) point. This approach is often used to inserí 
an outboard device directly into an input strip's signal path 
(Figure 14.4). 

■ A console's main output bus could be run through a device 
(such as a stereo compressor) to affect an overall mix or 
submix grouping. 

■ An effects stomp box could be placed between a mic 
preamp and consolé input to create a grungy distortion 
effect. 

■ A DAW plug-in could be inserted into an input path to 
process only the signal on that channel (Figure 14.5). 



EXTERNAL CONTROL OVER AN INLINE EFFECTS SIGNAL PATH 

Certain in-line effects processors allow for an external signal source to act as a 
control for affecting a signal as it passes from the input to the output of a device 
(Figure 14.6). Devices that offer an external "key" input can be quite useful, 


FIGURE 14.3 

Novation’s Nocturn 
plug-in controller 
with an automap 
function that 
automatically maps 
the Controls to 
various parameters 
on the target plug-in. 


from channel preamp 


Processing device 



to channel chain 


FIGURE 14.4 

An example of inline 
routing, whereby the 
processed signal is 
inserted directly into 
the path of an input 
strip. 
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FIGURE 14.5 

Example of how an 
inline effect can be 
inserted into a single 
channel on a DAW 
track. (Courtesy of 
Digidesign, a división 
of Avid Technology, 
Inc., www.digidesign. 
com.) 
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FIGURE 14.6 

Diagram of a key side-chain input to a noise gate. (a) The signal is passed whenever a signal is present at the key 
input. (b) No signal is passed when no signal is present at the key input. 
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allowing a signal source to be used as a control for varying another audio path. 
For example: 

■ A gate (an infinite expander that can control the passing of audio through 
a gain device) might take its control input from an external "key" signal 
that will determine when a signal will or will not pass in order to reduce 
leakage. 

■ A vocal track could be inserted into a vocorder's control input, so as to 
synthetically add a robot-like effect to a track. 

■ A voice track could be used for vocal ducking at a radio station, to fade 
out music when a narrator is speaking. 

■ An external keyed input can be used to make a mix "pump" or "breathe" 
in a dance production. 

It's important to note that there is no set standard for providing a side-chain 
key in software. Some software packages provide native side-chain capability, 
while some support side-chaining via "múltiple input" plug-ins and complex 
signal routing and others don't support side-chaining at all. 

Parallel routing 

Parallel routing (sometimes referred to as "side-chain" routing) is often used to 
augment a signal (generally being used to add reverb, delay or other time-based 
effects). It occurs whenever a portion of the original signal is allowed to pass 
through the chain while a side signal is simultaneously fed to an effects device. 
... Once effected, the signal is then proportionately mixed back in with the 
original signal to create an effects blend. This simple form of side-chain routing 
can happen in either of two ways: 

■ A signal source (or mixed grouping of effects send sources) can be sent to 
an effects device, which is then mixed back in with the original source at 
a consolé input or effects return bus (Figures 14.7 and 14.8). 

■ A signal source can be sent to an effects device that has an internal "Mix" 
control, which serves as a side-chain mix control for varying the amount 
of "dry" (original) signal to be mixed with the "wet" (effected) signal, as 
seen in Figure 14.9. 


EFFECT PROCESSING 

From this point on, this chapter will be taking an in-depth look at many of the 
signal processing devices, applications and techniques that have traditionally 
been the cornerstone of music and sound production, including systems and 
techniques that exert an ever-increasing degree of control over: 



Send 1 


FIGURE 14.7 

An “aux” sends path 
flows in a horizontal 
fashion to a send 
bus. The combined 
sum can then be 
effected (or sent to 
another destination) 
or returned back to 
the main output bus. 
(Courtesy of Loud 
Technologies Inc., 
www.mackie.com.) 


Effect Processing 


FIGURE 14.8 

An effects plug-in 
can be inserted into 
an effects send bus, 
allowing múltiple 
channels to be 
assigned to a single 
effects processor. 
(Courtesy of 
Digidesign, a división 
of Avid Technology, 
Inc., www.digidesign. 
com, and Universal 
Audio, www.uaudio. 
com. © 2009 
Universal Audio, 

Inc. All rights 
reserved. Used with 
permission.) 


FIGURE 14.9 

Example of a 
commonly found mix 
control that allows an 
inline device to work 
in a parallel fashion. 



a group send can route 
múltiple sources to a 
single effects processor 




dry/wet mix control 


■ The spectral content of a sound: in the form of equalization or intelligent 
equalization and bandpass filtering 

■ Amplitude level Processing: in the form of dynamic range processing 

■ Time-based effects: Augmentation or re-creation of room ambience, delay, 
time/pitch alterations and tons of other special effects that can range from 
being sublimely subtle to "in yo' face." 

Hardware and virtual effects in action 

The following sections offer some insight into the basics of effects processing 
and how they can be integrated into a recording or mixdown. It's a forgone 
conclusión that the power of these effects can be harnessed in hardware or 
software plug-in form. The important rule to remember is that there are no 
rules; however, there are a few general guidelines that can help you get the sound 
that you want. When using effects, the most important asset you can have is 
experience and your own sense of artistry. The best way to learn the art of pro¬ 
cessing, shaping and augmenting sound is through experience; gaining experi¬ 
ence takes time, a willingness to learn and lots of patience. 

As stated in Chapter 13, an audio equalizer (Figure 14.10 and 14.11) is a Circuit, 
device or plug-in that lets us exert control over the harmonic or timbral content 
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FIGURE 14.10 

Manley Massive 
Passive Stereo 
Equalizer (Courtesy 
of Manley 
Laboratories, Inc., 
www.manleylabs. 
com.) 
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FIGURE 14.11 

EQ plug-ins: (a) Pro 
Tools 4-band 
EQII (courtesy of 
Digidesign, a 
división of Avid 
Technology, Inc., 
www.digidesign. 
com); (b) Cubase/ 
Nuendo (courtesy 
of Steinberg Media 
Technologies GmbH, 
a división of Yamaha 
Corporation, www. 
steinberg.net.) 


of a recorded sound. EQ may need to be applied to a single recorded channel, 
to a group of channels or to an entire program. 

Equalization refers to the alteration in frequency response of an amplifier so 
that the relative levels of certain frequencies are more or less pronounced than 
others. EQ is specified as either plus or minus a certain number of decibels at 
a certain frequency. For example, you might want to boost a signal by "+4 dB 
at 5 kHz." Although only one frequency was specified in this example, in reality 
a range of frequencies above, below and centered around the specified frequency 
will often be affected. The amount of boost or cut at frequencies other than the 
one named is determined by whether the curve is peaking or shelving, by the 
bandwidth of the curve (a factor that's affected by the Q settings and determines 
how many frequencies will be affected around a chosen centerline), and by 
the amount of boost or cut at the named frequency. For example, a +4-dB 
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FIGURE 14.11 

Continued 
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boost at 1000 Hz might easily add a degree of boost or cut at 800 and 1200 Hz 
(Figure 14.12). 

Older equalizers and newer "retro" systems often base their design around filters 
that use passive components (i.e., inductors, capacitors and resistors) and 
employ amplifiers only to make up for internal losses in level, called insertion 
loss. Most equalization circuits today, however, are of the active filter type that 
change their characteristics by altering the feedback loop of an operational amp. 
This is by far the most common analog EQ type and is generally favored over 
its passive counterpart due to its low cost, size and weight, as well as its wide 
gain range and line-driving capabilities. 

Peaking filters 

The most common EQ curve is created by a peaking filter. As its ñame implies, 
a peak-shaped bell curve can either be boosted or cut around a selected center 
frequency. Figure 14.13 shows the curves for a peak equalizer that's set to boost 
or cut at 1000 Hz. The quality factor (Q) of a peaking equalizer refers to the 
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FIGURE 14.12 

Various boost/cut EQ 
curves centered 
around 1 kHz: 

(a) center frequency, 
1-kHz bandwidth 
1 octave, ±15-dB 
boost/cut; (b) center 
frequency, 1-kHz 
bandwidth 3 octaves, 
±15-dB boost/cut. 
(Courtesy of Mackie 
Designs, www. 
mackie.com.) 


width of its bell-shaped curve. A curve with a high Q will have a narrow band¬ 
width with few frequencies outside the selected bandwidth being affected, 
whereas a curve having a low Q is very broadband and can affect many frequen¬ 
cies (or even octaves) around the center frequency. Bandwidth is a measure of 
the range of frequencies that lie between the upper and lower -3-dB (half- 
power) points on the curve (Figure 14.14). The Q of a filter is an inverse measure 
of the bandwidth (such that higher Q valúes mean that fewer frequencies will 
be affected, and vice versa). To calcúlate Q, simply divide the center frequency 
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FIGURE 14.13 

Peaking equalization 
curves. 




Frequency 


FIGURE 14.14 

The number of hertz 
between the two 
points that are 3 dB 
down from the center 
frequency determines 
the bandwidth of a 
peaking filter. 


by the bandwidth. For example, a filter centered at 1 kHz that's 
a third of an octave wide will have its -3-dB frequency points 
located at 891 and 1123 Hz, yielding a bandwidth of 232 Hz 
(1123 - 891). This EQ curve's Q, therefore, will be 1 kHz 
divided by 232 Hz or 4.31. 

Shelving filters 

Another type of equalizer is the shelving filter. Shelving refers 
to a rise or drop in frequency response at a selected frequency, 
which tapers off to a preset level and continúes at that level to 
the end of the audio spectrum. Shelving can be inserted at either the high or 
low end of the audio range and is the curve type that's commonly found on 
home stereo bass and treble Controls (Figure 14.15). 

High-pass and low-pass filters 


Equalizer types also inelude high-pass and low-pass filters. As their ñames imply, 
this EQ type allows certain frequency bandwidths to be passed at full level while 
other sections of the audible spectrum are attenuated. Frequencies that are 
attenuated by less than 3 dB are said to be inside the passband ; those attenuated 
by more than 3 dB are in the stopband. The frequency at which the signal is 
attenuated by exactly 3 dB is called the turnover or cutoff frequency and is used 
to ñame the filter frequency. 


Ideally, attenuation would become infinite immediately outside the passband; 
however, in practice this isn't always attainable. Commonly, attenuation is 
carried out at rates of 6, 12 and 18 dB per octave. This rate is called the slope of 
the filter. Figure 14.16a, for example, shows a 700-Hz high-pass filter response 
curve with a slope of 6 dB per octave, and Figure 14.16b shows a 700-Hz low- 
pass filter response curve having a slope of 12 dB per octave. High- and low-pass 
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FIGURE 14.15 

High/low, boost/cut 
curves of a shelving 
equalizer. 




FIGURE 14.16 

A 700-Hz filter: (a) high-pass filter with a slope of 6 dB per octave; (b) low-pass filter with a slope of 12 dB per octave. 


filters differ from shelving EQ in that their attenuation doesn't level off outside 
the passband. Instead, the cutoff attenuation continúes to increase. A high-pass 
filter in combination with a low-pass filter can be used to create a bandpass filter, 
with the passband being controlled by their respective turnover frequencies and 
the Q by the filter's slope (Figure 14.17). 

Equalizer types 

The four most commonly used equalizer types that can incorpórate one or more 
of the previously described filter types are the: 

■ Parametric equalizer 

■ Selectable frequency equalizer 

■ Graphic equalizer 

■ Notch filter. 
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FIGURE 14.17 

A bandpass filter is 
created by combining 
a high- and low-pass 
filter with different 
cutoff frequencies. 


FIGURE 14.18 

The EQF-100 full 
range, parametric 
vacuum tube 
equalizer. (Courtesy 
of Summit Audio, 
Inc., www. 
summitaudio.com.) 




The parametric equalizer (Figure 14.18) lets you adjust most or all of its frequency 
parameters in a continuously variable fashion. Although the basic design layout 
will change from model to model, each band will often have an adjustment for 
continuously varying the center frequency. The amount of boost or cut is also 
continuously variable. Control over the center frequency and Q can be either 
selectable or continuously variable, although certain manufacturers might not 
have provisions for a variable Q. 

Generally, each set of frequency bands will overlap into the next band section, 
so as to provide smooth transitions between frequency bands or allow for múl¬ 
tiple curves to be placed in nearby frequency ranges. Because of its flexibility 
and performance, the parametric equalizer has become the standard design for 
most input strips, digital equalizers and workstations. 

The selectable frequency equalizer (Figure 14.19), as its ñame implies, has a set 
number of frequencies from which to choose. These equalizers usually allow a 
boost or cut to be performed at a number of selected frequencies with a prede- 
termined Q. They are most often found on older consolé designs, certain low- 
cost production consoles and outboard gear. 

A graphic equalizer (Figure 14.20) provides boost and cut level control over a 
series of center frequencies that are equally spaced (ideally according to music 
intervals). An "octave band" graphic equalizer might, for example, have 12 
equalization Controls spaced at the octave intervals of 20, 40, 80, 160, 320 and 
640 Hz and 1.25, 2.5, 5, 10 and 20 kHz, while 1/3-octave equalizers could have 
up to 36 center frequency Controls. The various EQ band Controls generally use 
vertical sliders that are arranged side by side so that the physical positions of 
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FIGURE 14.19 

The API 550A 
selectable frequency 
equalizer within the 
7600 outboard 
module. (Courtesy of 
API Audio, www. 
apiaudio.com.) 



FIGURE 14.20 

Rane GE 130 
single-channel, 
30-band, 1/3-octave 
graphic equalizer. 
(Courtesy of Rane 
Corporation, www. 
rane.com.) 


these Controls could provide a "graphic" readout of the overall frequency 
response curve at a glance. This type is often used in applications that can help 
fine-tune a system to compénsate for the acoustics in various types of rooms, 
auditoriums and studio control rooms. 

Notch filters are often used to zero in on and remove 60- or 50-Hz hum 
or other undesirable discrete-frequency noises. They use a very narrow band- 
width to fine-tune and attenuate a particular frequency in such a way as to have 
little effect on the rest of the audio program. Notch filters are used more in film 
location sound and broadcast than in studio recording, because 
severe narrow-band problems aren't often encountered 
in a well-designed studio ... hopefully. 


Applying equalization 

When you get right down to it, EQ is all about 
compensating for deficiencies in a sound 
pickup or about reducing extraneous sounds 
that make their way into a pickup signal. To start 
our discussion on how to apply EQ, let's again 
revisit the all-important "Good Rule" from Chapter 4. 


THE "GOOD RULE" 

Good musician + good instrument 
+ good performance + good acoustics 
+ good mic + good placement 
= good sound. 


Whenever possible, EQ should not be used as a Band-Aid. By this, I mean that 
it's often a good idea to correct for a problem on the spot rather than to rely 
on the hope that you can "fix it in the mix" at a later time using EQ and other 
methods. 
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When in doubt, it's often better to deal with a problem as it occurs. This 
isn't always possible, however ... therefore, EQ is best used in situations 
where: 

■ There's no time or money left to redo the track. 

■ The existing take was simply magical and shouldn't be re-recorded. 

■ The track was already recorded during a previous session. 

EQ in action! 

Although most equalization is done by ear, it's helpful to have a sense of 
which frequencies affect an instrument in order to achieve a particular effect. 
On the whole, the audio spectrum can be divided into four frequency bands: 
low (20 to 200 Hz), low-mid (200 to 1000 Hz), high-mid (1000 to 5000 Hz) 
and high (5000 to 20,000 Hz). When the frequencies in the 20- to 200-Hz 
(low) range are modified, the fundamental and the lower harmonic range 
of most bass information will be affected. These sounds often are felt as 
well as heard, so boosting in this range can add a greater sense of power or 
punch to music. Lowering this range will weaken or thin out the lower frequency 
range. 

The fundamental notes of most instruments lie within the 200- to 1000-Hz 
(low-mid) range. Changes in this range often result in dramatic variations in 
the signal's overall energy and add to the overall impact of a program. Because 
of the ear's sensitivity in this range, a minor change can result in an effect that's 
very audible. The frequencies around 200 Hz can add a greater feeling of warmth 
to the bass without loss of definition. Frequencies in the 500- to 1000-Hz range 
could make an instrument sound hornlike, while too much boost in this range 
can cause listening fatigue. 

Higher-pitched instruments are most often affected in the 1000- to 5000-Hz 
(high-mid) range. Boosting these frequencies often results in an added sense of 
clarity, definition and brightness. Too much boost in the 1000- to 2000-Hz 
range can have a "tinny" effect on the overall sound, while the upper mid-fre- 
quency range (2000 to 4000 Hz) affects the intelligibility of speech. Boosting 
in this range can make music seem closer to the listener, but too much of a 
boost often tends to cause listening fatigue. 

The 5000- to 20,000-Hz (high-frequency) región is composed almost entirely 
of instrument harmonics. For example, boosting frequencies in this range will 
often add sparkle and brilliance to a string or woodwind instrument. Boosting 
too much might produce sibilance on vocals and make the upper range of 
certain percussion instruments sound harsh and brittle. Boosting at around 
5000 Hz has the effect of making music sound louder. A 6-dB boost at 5000 Hz, 
for example, can sometimes make the overall program level sound as though 
it's been doubled in level; conversely, attenuation can make music seem more 
distant. Table 14.1 provides an analysis of how frequencies and EQ settings can 
interact with various instruments. (For more information, refer to the Micro- 
phone Placement Techniques section in Chapter 4.) 
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Table 14.1 

Instrumental Frequency Ranges of Interest 

Instrument 

Frequencies of Interest 

Kick drum 

Bottom depth at 60-80 Hz, slap attack at 2.5 kHz 

Snare drum 

Fatness at 240 Hz, crispness at 5 kHz 

Hi-hat/cymbals 

Clank or gong sound at 200 Hz, shimmer at 7.5 kHz to 
12 kHz 

Rack toms 

Fullness at 240 Hz, attack at 5 kHz 

Floor toms 

Fullness at 80-120 Hz, attack at 5 kHz 

Bass guitar 

Bottom at 60-80 Hz, attack/pluck at 700-1000 Hz, 
string noise/pop at 2.5 kHz 

Electric guitar 

Fullness at 240 Hz, bite at 2.5 kHz 

Acoustic guitar 

Bottom at 80-120 Hz, body at 240 Hz, clarity at 

2.5-5 kHz 

Electric organ 

Bottom at 80-120 Hz, body at 240 Hz, presence at 

2.5 kHz 

Acoustic piano 

Bottom at 80-120 Hz, presence at 2.5-5 kHz, crisp 
attack at 10 kHz, honky-tonk sound (sharp Q) at 2.5 kHz 

Horns 

Fullness at 120-240 Hz, shrill at 5-7.5 kHz 

Strings 

Fullness at 240 Hz, scratchiness at 7.5-10 kHz 

Conga/bongo 

Resonance at 200-240 Hz, presence/slap at 5 kHz 

Vocals 

Fullness at 120 Hz, boominess at 200-240 Hz, 
presence at 5 kHz, sibilance at 7.5-10 kHz 


Note: These frequencies aren’t absolute for all ¡nstruments, but are meant as a 
subjective guide. 


01Y 1 

do it yourself 

Tutorial: Equalization 


1. Solo an ¡nput strip on a mixer, consolé or DAW. 
Experiment with the settings using the previous 
frequency ranges. Can you improve on the original 
recorded track or does it take away from the 
sound? 

2. Using the input strip equalizers on a mixer, consolé or 
DAW, experiment with the EQ settings and relative 
instrument levels of an entire mix using the previous 


frequency ranges as a guide. Can you bring an 
instrument out without changing the fader gains? Can 
you alter the settings of two or more ¡nstruments to 
increase the mix’s overall clarity? 

3. Plug the main output buses of a mixer, consolé or 
DAW into an outboard equalizer, and change the 
program’s EQ settings using the previous frequency 
range discussions as a guide. 
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One way to zero in on a particular frequency using an equalizer (especially a 
parametric one) is to accentuate or attenuate the EQ level and then vary the 
center frequency until the desired range is found. The level should then be 
scaled back until the desired effect is obtained. If boosting in one-instrument 
range causes you to want to do the same in other frequency ranges, it's likely 
that you're simply raising the program's overall level. It's easy to get caught up 
in the "bigger! better! more!" syndrome of wanting an instrument to sound 
louder. If this continúes to happen on a mix, it's likely that one of the frequency 
ranges of an instrument or ensemble is too dominant and requires attenuation. 
On the subject of laying down a recorded track with EQ, there are a number of 
situations and differing opinions regarding them: 

■ Some use EQ liberally to make up for placement and mic deficiencies, 
whereas others might use it sparingly, if at all. One example where EQ is 
used sparingly is when an engineer knows that someone else will be 
mixing a particular song or project. In this situation, the engineer who's 
doing the mix might have a very different idea of how an instrument 
should sound. If large amounts of EQ were recorded to a track during the 
session, the mix engineer might have to work very hard to counteract the 
original EQ settings. 

■ If everything was recorded fíat, the producer and artists might have diffi- 
culty passing judgment on a performance or hearing the proper balance 
during the overdub phase. Such a situation might cali for equalization in 
the monitor mix, while leaving the recorded tracks alone. 

■ In situations where several mies are to be combined onto a single 
track or channel, the mies can be individually equalized only during 
the recording phase. In situations where a project is to be engineered, 
mixed and possibly even mastered, the engineer might want to know 
in advance the type and amount of EQ that the producer and/or artist 
would want. 

■ Above all, it's wise that any "sound-shaping" should be determined and 
discussed with the producer and/or artist befo re the sounds are committed 
to a track. 

Whether you choose to use EQ sparingly, as a right-hand tool for correcting 
deficiencies or not at all, there's no getting around the fact that an equalizer is 
a powerful tool. When used properly, it can greatly enhance or restore the 
musical and sonic balance of a signal. Experimentation and experience are the 
keys to proper EQ usage, and no book can replace the trial-and-error process of 
"just doing it!" 

Before moving on, it's important to keep one age-old viewpoint in mind—that 
an equalizer shouldn't be regarded as a cure-all for improper mic technique; 
rather, it should be used as a tool for correcting problems that can't be easily 
fixed on the spot through mic and/or performance adjustments. If an instru¬ 
ment is poorly recorded during an initial recording session, it's often far more 
difficult and time consuming to "fix it in the mix" at a later time. Getting the 
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best possible sound down onto tape will definitely improve your chances for 
attaining a sound and overall mix that you can be proud of. 


DYNAMIC RANGE 

Like most things in Ufe that get out of hand from time to time, the level of a 
signal can vary widely from one moment to the next. For example, if a vocalist 
gets caught up in the moment and lets out an impassioned scream following a 
soft whispery passage, you can almost guarantee that the mic's signal will jump 
from its optimum recording level into severe distortion ... OUCH! Conversely, 
if you set an instrument's mic to accommodate the loudest level, its signal might 
be buried in the mix during the rest of the song. For these and other reasons, 
it becomes obvious that it's sometimes necessary to exert some form of control 
over a signal's dynamic range by using various techniques and dynamic control- 
ling devices. In short, the dynamics of an audio program's signal resides some- 
where in a continuously varying realm between three level States: 

■ Saturation 

■ Average signal level 

■ System/ambient noise. 

As you may remember from various chapters in this book, saturation occurs 
when an input signal is so large that an amp's supply voltage isn't large enough 
to produce the required output current or is so large that a digital converter 
reaches full scale (where the A/D output reads as all l's). In either case, the 
results generally don't sound pretty and should be avoided in the studio's audio 
chain. The average signal level is where the overall signal level of a mix resides. 
Logically, if an instrument's level is too low, it can get buried in the mix ... if 
it's too high, it can unnecessarily stick out and throw the entire balance off. It 
is here that the art of mixing at an average level that's high enough to stand out 
in the sonic crowd, while still retaining enough dynamic "life," truly becomes 
an applied balance of skill and magic. The following sections on readout dis- 
plays and dynamic range processors can be used in the recording process to 
tame the audio signal in such a way that instruments, vocals, and the like can 
be placed squarely in the sonic pocket, at an optimum signal level. 

Metering 

Amplifiers, magnetic tape and even digital media are limited in the range of 
signal levels that they can pass without distortion. As a result, audio engineers 
need a basic standard to help determine whether the signáis they're working 
with will be stored or transmitted without distortion. The most convenient way 
to do this is to use a visual level display, such as a meter. Two types of metering 
ballistics (active response times) are encountered in recording sound to either 
analog or digital media: 

■ Average (rms) 

■ Peak. 


Dynamic Range 


From Chapter 2, we know that the root-mean-square (rms) valué was developed 
to determine a meaningful average level of a waveform over time. Since humans 
perceive loudness according to a signal's average valué (in a way that doesn't 
bear much relationship to a signal's instantaneous peak level), the displays of 
many meters will indicate an average signal-level readout. The total amplitude 
measurement of the positive and negative peak signal levels is called the peak- 
to-peak valué. A readout that measures the máximum amplitude fluctuations of 
a waveform is a peak-indicating meter. 

One could definitely argüe that both average and peak readout levels have their 
own set of advantages. For example, the ear's perception of loudness is largely 
proportional to the rms (average) valué of a signal, not its peak valué. On the 
other hand, a peak readout displays the actual amplitude at a particular point 
in time and not the overall perceived level. For this reason, a peak meter might 
show readings that are noticeably higher at a particular point in the program 
than the averaged rms counterpart (Figure 14.21). Such a reading will alertyou 
to the fact that the short-term peaks are at levels that are above the clipping 
point, while the average signal is below the máximum limits. Llnder such con- 
ditions (where short-duration peaks are above the distortion limit), you might 
or might not hear distortion as it often depends on the makeup of the signal 
that's being recorded; for example, the clipped peaks of a bass guitar will not 
be nearly as noticeable as the clipped high-end peaks of a cymbal. The record- 
ing médium often plays a part in how a meter display will relate to sonic reality; 
for example, recording a signal with clipped peaks onto a tube analog tape 
machine might be barely noticeable (because the tubes and the tape médium 
act to smooth over these distortions), whereas a DAW or digital recorder might 
churn out hash that's as ugly as the night (or your current session) is long. 

Getting to the heart of the matter, it goes without saying that, whenever the 
signal is too high (hot), it's an indication for you to grab hold of the channel's 
mic trim, output fader or whatever level control is the culprit and turn it down. 
In doing so, you've actually become a dynamic range-changing device. In fact, 
the main channel fader (which can be controlling an input level during 


FIGURE 14.21 

A peak meter reads 
higher at point A 
than at point B, even 
though the average 
loudness level is the 
same. 
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recording or a tape track's level during mixdown) is by far the most intuitive 
and most often used dynamic gain-changing devi ce in the studio. 

In practice, the difference between the máximum level that can be handled 
without incurring distortion and the average operating level of the system is 
called headroom. Some studio-quality preamplifiers are cap able of signal outputs 
as high as 26 dB above 0 VU and thus are said to have 26 dB of headroom. 
With regard to analog tape, the 3% distortion level for analog magnetic tape is 
typically only 8 dB above 0 VU. For this reason, the best recording level for most 
program material is around 0 VU (although higher levels are possible provided 
that short-term peak levels aren't excessively high). In some circumstances (i.e., 
when using higher bias, low-noise/high-output analog tape), it's actually pos¬ 
sible to record at higher levels without distortion, because the analog tape 
formulation is cap able of handling higher magnetic flux levels. With regard to 
digital media, the guidelines are often far less precise and will often depend on 
your currently chosen bit rate. Since a higher bit rate (e.g., 24 or 32 bits) directly 
translates into a wider dynamic range, it's often a good idea to back off from 
the máximum level—because noise generally isn't a problem at these bit rates. 


In fact, the optimal average recorded level for a 
44.1 -k/16-bit recording ¡s often agreed to be around 
-12 dB, so as to avoid peak clipping. When recording at 
higher word lengths, the encoded dynamic range is often 
so wide that the average level could easily be further 


reduced by 10 or more decibels. Reducing the recorded 
levels beyond this could have the unintended side effect 
of unnecessarily increasing distortion due to quantization 
noise (although this usually isn’t an overriding factor at 
higher bit rates). 


... Now that we've gotten a few of the basic concepts out of the way, let's take 
a brief look at two of the most common meter readout displays. 

THE VU METER 

The traditional signal-level indicator for analog equipment is the VU meter 
(Figure 14.22). The scale chosen for this device is calibrated in volume units 
(henee its ñame) and is designed to display a signal's average rms level over 
time. The standard operating level for most consoles, mixers and analog tape 
machines is considered to be 0 VU. Although VU meters do the job of indicat- 
ing rms volume levels, they ignore the short-term peaks that can overload a 
track. This means that the professional consolé Systems must often be designed 
so that unacceptable distortion doesn't occur until at least 14 dB above 0 VU. 
Typical VU meter specifications are listed in Table 14.2. 

Since recording is an art form, I have to rise to the defense of those who prefer 
to record certain instruments (particularly drums and percussion) at levels that 
bounce or even "pin" VU needles at higher levels than 0 VU. When recording 


Dynamic Range 


FIGURE 14.22 

A VU meter’s upper 
scale is calibrated in 
volume units (used 
for recording), while 
the lower scale is 
measured in 
percentage of 
modulation (often 
used in broadcast). 



Table 14.2 VU Meter Specifications 

Characteristic 

Specification 

Sensitivity 

Reads 0 VU when fed a +4-dBm signa! (1.228 V 
into a 600-Q Circuit) 

Frequency response 

±0.2 dB from 35 Hz to 10 kHz; ±0.5 dB from 

25 Hz to 16 kHz 

Overload capability 

Can withstand 5 times 0-VU level (approximately 
+18 dBm) continuously and 10 times 0-VU level 
(+24 dBm) for 0.5 sec 


to a professional analog machine, this can actually give a track a "gutsy" feel 
that can add impact to a performance. This is rarely a good idea when recording 
instruments that contain high-frequency/high-level signáis (such as a snare or 
cymbals), because the peak transients will probably distort in a way that's hardly 
pleasing ... and it's almost NEVER a good idea when recording to a digital 
system. Always be aware that you can often add distortion to a track at a later 
time (using any number of ingenious tricks), but you can't remove it from an 
existing track. As always, it's wise to talk such moves over with the producer and 
artist beforehand. 

THE AVERAGING/PEAK METER 

While many analog devices display levels using the traditional VU meter, most 
digital hardware and software devices will often display a combination of VU 
and peak program-level metering, using an LCD or on-screen readout. This 
best-of-both-worlds system makes sense in the digital world as it gives us a 
traditional readout that visually corresponds to what our ears are hearing, while 
providing a quick-and-easy display of the peak levels at any point in time. Often, 
the peak readout is frozen in position for a few seconds before resetting to a 
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new level (making it easier to spot the máximum levels), or it's permanently 
held at that level until a higher level comes along to bump it up. Of course, 
should a peak level approach the clipping level, a red "clip" indicator will often 
light, showing that it's time to back off the levels. 

The idea of "pinning" the input levels of any digital recording device is a definite 
no-no. The dreaded "clip" indicator of a digital meter means thatyou've reached 
the saturation point, with no headroom to spare. Unfortunately, digital stan¬ 
dard operating levels are far more ambiguous than their analog counterparts, 
and you should consult the particular device manual for level recommendations 
(if it says anything about the subject at all). In light of this, I actually encourage 
you to slightly pin the meters on several digital devices (make sure the monitors 
aren't turned up too far) just to find out how obnoxious even the smallest 
amount of clipping can sound ... and how those levels might vary from device 
to device. Pretty harsh, huh? 

Dynamic range processors 

The overall dynamic range of music is potentially on the order of 120 to 140 dB, 
whereas the overall dynamic range of a compact disc is often 80 to 90 dB, and 
analog magnetic tape is on the order of 60 dB (excluding the use of noise- 
reduction systems, which can improve this figure by 15 to 30 dB). However, 
when working with 20- and 24-bit digital word lengths, a system, processor or 
channel's overall dynamic range can actually approach or exceed the full range 
of hearing. Even with such a wide dynamic range, unless the recorded program 
is played back in a noise-free environment, either the quiet passages will get 
lost in the ambient noise of the listening area (35 to 45 dB SPL for the average 
home and much worse in a car) or the loud passages will simply be too loud 
to bear. Similarly, if a program of wide dynamic range were to be played through 
a médium with a limited dynamic range (such as the 20- to 30-dB range of an 
AM radio or the 40- to 50-dB range of FM), a great deal of information would 
get lost in the background noise. To prevent these problems, the dynamics of a 
program can be restricted to a level that's appropriate for the reproduction 
médium (theater, radio, home system, car, etc.) as shown in Figure 14.23. This 
gain reduction can be accomplished either by manually riding the fader's gain 
or through the use of a dynamic range processor that can alter the range between 
the signal's softest and loudest passages. 

The concept of automatically changing the gain of an audio signal (through the 
use of compression, limiting and/or expansión) is perhaps one of the most 
misunderstood aspects of audio recording. This can be partially attributed to 
the fact that a well-done job won't be overly obvious to the listener. Changing 
the dynamics of a track or overall program will often affect the way in which it 
will be perceived (either unconsciously or consciously) by making it "seem" 
louder, by reducing its volume range to better suit a particular médium or by 
making it possible for a particular sound to ride at a better level above other 
tracks within a mix. 
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FIGURE 14.23 

Dynamic ranges of 
various audio media, 
showing the noise 
floor (black), average 
level (white) and 
peak levels (gray). 
(Courtesy of Thomas 
Lund, te electronic, 
www.tcelectronic. 
com.) 


FIGURE 14.24 

Universal Audio 
1176LN limiting 
amplifier. (Courtesy 
of Universal Audio, 
www.uaudio.com. 

© 2009 Universal 
Audio, Inc. All rights 
reserved. Used 
with permission.) 
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Compression 

A compressor (Figure 14.24), in effect, can be thought of as an automatic fader. 
It is used to proportionately reduce the dynamics of a signal that rises above a 
user-definable level (known as the threshold ) to a lesser volume range. This 
process is done so that: 

■ The dynamics can be managed by the electronics and/or amplifiers in the 
signal chain. 

■ The range is appropriate to the overall dynamics of a playback or broadeast 
médium. 

■ An instrument better matches the dynamics of other recorded tracks 
within a song or audio program. 

Since the signáis of a track, group or program will be automatically turned down 
(henee the terms compressed or squashed ) during a loud passage, the overall level 
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of the newly reduced signal can now be amplified. In other words, once the 
dynamics have been reduced downward, the overall level can be boosted such 
that the range between the loud and soft levels is less pronounced (Figure 
14.25). We've not only restored the louder signáis back to a prominent level but 
we have also turned up the softer signáis. In effect, we've turned up the softer 
signáis that would otherwise be buried in the mix or ambient background noise. 

The most common Controls on a compressor (and most other dynamic range 
devices) inelude input gain, threshold, output gain, slope ratio, attack, release 
and meter display: 


■ Input gain: This control is used to determine how much signal will be sent 
to the compressor's input stage. 

■ Threshold: This setting determines the level at which the compressor will 
begin to proportionately reduce the incoming signal. For example, if the 
threshold is set to -20 dB, all signáis that fall below this level will be 
unaffected, while signáis above this level will be proportionately attenu- 
ated, thereby reducing the overall dynamics. On some devices, varying the 
input gain will correspondingly control the threshold level. In this situa- 
tion, raising the input level will lower the threshold point and thus reduce 
the overall dynamic range. Most quality compressors offer hard and soft 
knee threshold options. A soft knee widens or broadens the threshold 
range, making the onset of compression less obtrusive, while the hard knee 
setting causes the effect to kick in quickly above the threshold point. 

■ Output gain: This control is used to determine how much signal will be 
sent to the device's output. It's used to boost the reduced dynamic signal 
into a range where it can best match the level of a médium or be better 
heard in a mix. 

■ Slope ratio: This control determines the slope of the input-to-output gain 
ratio. In simpler terms, it determines the amount of input signal (in deci- 
bels) that's needed to cause a 1-dB increase at the compressor's output 
(Figure 14.26). For example, with a ratio of 4:1, for every 4-dB increase 
at the input there will be only a 1-dB increase at the output; an 8-dB input 
increase will raise the output by 2 dB, while a ratio of 2:1 will produce a 
1-dB increase in output for every 2-dB increase at its input. Get the idea? 

■ Attack: This setting (which is calibrated in milliseconds; 1 msec = 1 thou- 
sandth of a second) determines how fast or how slowly the device will 



2:1 compression / 30 dB gain 
(signáis abo ve [he threshold 
are proportionately turned down) 


threshold (set at 30 dB) 


noise floor 




FIGURE 14.25 

A compressor 
reduces input levels 
that exceed a 
selected threshold by 
a specified amount. 
Once reduced, the 
overall signal can 
then be boosted 
in level, thereby 
allowing the softer 
signáis to be raised 
above other program 
or background 
sounds. 
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FIGURE 14.26 

The output of a 
compressor is linear 
below the threshold 
and follows an input/ 
output gain reduction 
ratio above this point, 
while the signal is 
proportionately 
reduced for signáis 
that fall above the 
threshold level. 



turn down signáis that exceed the threshold. It is defined as the time it 
takes for the gain to decrease to a percentage (usually 63%) of its final 
gain valué. In certain situations (as might occur with instruments that have 
a long sustain, such as the bass guitar), setting a compressor to instantly 
turn down a signal might be audible (possibly creating a sound that 
pumps the signal's dynamics). In this situation, it would be best to use a 
slower attack setting. On the other hand, such a setting might not give the 
compressor time to react to sharp, transient sounds (such as a hi-hat). In 
this case, a fast attack time would probably work better. As you might 
expect, you'll need to experiment to arrive at the fastest attack setting that 
won't audibly color the signal's sound. 

■ Release : Similar to the attack setting, release (which is calibrated in mil- 
liseconds) is used to determine how slowly or quickly the device will 
restore a signal to its original dynamic level once it has fallen below the 
threshold point (defined as the time required for the gain to return to 
63% of its original valué). Too fast a setting will cause the compressor to 
change dynamics too quickly (creating an audible pumping sound), while 
too slow a setting might affect the dynamics during the transition from a 
loud to a softer passage. Again, it's best to experiment with this setting to 
arrive at the slowest possible release that won't color the signal's sound. 

■ Meter display : This control changes the compressor's meter display to read 
the device's output or gain reduction levels. In some designs, there's no 
need for a display switch, as readouts are used to simultaneously display 
output and gain reduction levels. 

As previously stated, the use of compression (and most forms of dynamics 
Processing) is often misunderstood, and compression can easily be abused. 
Generally, the idea behind these processing systems is to reduce the overall 
dynamic range of a track, music or sound program or to raise its overall per- 
ceived level without adversely affecting the sound of the track itself. It's a well- 
known fact that overcompression can actually squeeze the life out of a 
performance by limiting the dynamics and reducing the transient peaks that can 
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give life to a performance. For this reason, it's important to be aware of the 
general nuances of the Controls we've just discussed. 

During a recording or mixdown session, compression can be used in order to 
balance the dynamics of a track to the overall mix or to keep the signáis from 
overloading preamps, the recording médium and your ears. Compression 
should be used with care for any of the following reasons: 

■ Minimize changes in volume that occur whenever the dynamics of an 
instrument or vocal are too great for the mix. As a tip, a good starting 
point might be a 0-dB threshold setting at a 4:1 ratio, with the attack and 
release Controls set at their middle positions. 

■ Smooth out momentary changes in source-to-mic distance. 

■ Balance out the volume ranges of a single instrument. For example, the 
notes of an electric or upright bass often vary in volume from string to 
string. Compression can be used to "smooth out" the bass line by match- 
ing their relative volumes. In addition, some instruments (such as horns) 
are louder in certain registers because of the amount of effort that's 
required to produce the notes. Compression is often useful for equalizing 
these volume changes. As a tip, you might start with a ratio of 5:1 with 
a medium-threshold setting, médium attack and slower release time. 
Overeompression should be avoided to avoid pumping effeets. 

■ Reduce other frequeney bands by inserting a filter into the compression 
chain that causes the circuit to compress frequencies in a specific band 
(frequency-selective compression). A common example of this is a de- 
esser, which is used to detect high frequencies in a compressor's circuit so 
as to suppress those "SSSS," "CHHH" and "FFFF" sounds that can distort 
or stand out in a recording. 

■ Reduce the dynamic range and/or boost the average volume of a mix so 
that it appears to be significantly louder (as occurs when a televisión com- 
mercial seems louder than the show). 

Although it may not always be the most important, this last application often 
gets a great deal of attention, because many producers strive to cut their record- 
ings as "hot" as possible. That is, they want the recorded levels to be as far above 
the normal operating level as possible without blatantly distorting. In this com- 
petitive business, the underlying logic behind the concept is that louder record- 
ings (when broadeast on Top 40 radio or podcast or played on a múltiple CD 
changer or an MP3 player) will stand out from the softer recordings in a playl- 
ist. In fact, reducing the dynamic range of a song or program's dynamic range 
will actually make the overall levels appear to be louder. By using a slight (or 
not-so-slight) amount of compression and limiting to squeeze an extra 1- or 
2-dB gain out of a song, the increased gain will also add to the perceived bass 
and highs because of our ears' increased sensitivity at louder levels (remember 
the Fletcher-Munson curve discussed in Chapter 2?). To achieve these hot levels 
without distortion, multiband compressors and limiters often are used during 
the mastering process to remove peaks and to raise the average level of the 
program. You'll find more on this subject in Chapter 18 (Mastering). 
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Compressing a mono mix is done in much the same way as one might compress 
a single instrument. Adjusting the threshold, attack, release and ratio Controls, 
however, is more critical in order to prevent the pumping of prominent instru- 
ments within the mix. Compressing a stereo mix gives rise to an additional 
problem: If two independent compressors are used, a peak in one channel will 
only reduce the gain on that channel and will cause sounds that are centered 
in a stereo image to shift (or jump) toward the channel that's not being com- 
pressed (since it will actually be louder). To avoid this center shifting, most 
compressors (of the same make and model) can be linked as a stereo pair. This 
procedure of ganging the two channels together interconnects the signal-level 
sensing circuits in such a way that a gain reduction in one channel will cause 
an equal reduction in the other (thereby preventing the center information from 
shifting in the mix). 

Before we move on, let's take a look at a few examples of the use of compres- 
sion in various applications. Keep in mind, these are only beginning suggestions 
... nothing can substitute for experimenting and finding the settings that work 
best for you and the situation: 

■ Acoustic guitar: A modérate degree of compression (3 to 8 dB) with a 
médium compression ratio can help to pulí an acoustic forward in a mix. 
A slower attack time will allow the string's percussive attack to pass 
through. 

■ Bass guitar: The electric bass is often a foundation instrument in pop and 
rock music. Due to variations in note levels from one note to another on 
an electric bass guitar (or upright acoustic, for that matter), a compressor 
can be used to even out the notes and add a bit of presence and/or punch 
to the instrument. Since the instrument often (but not always) has a 
slower attack, it's often a good idea to start with a médium attack (4:1, 
for example) and threshold setting, along with a slower release time 
setting. Harder compression of up to 10:1 with gain reductions ranging 
from 5 to 10 dB can also give a good result. 

■ Brass: The use of a faster attack (1 to 5 ms) with ratios that range from 
6:1 to 15 :1 and modérate to heavy gain reduction can help keep the brass 
in line. 

■ Electric guitar: In general, an electric guitar won't need much compression, 
because the sound is often evened out by the amp, the instrument's 
natural sustain character and processing pedáis. If desired, a heavier com¬ 
pression ratio, with 10 or more decibels of compression can add to the 
instrument's "bite" in a mix. A faster attack time with a longer release is 
often a good place to start. 

■ Kick drum and snare: These driving instruments often benefit from added 
compression. For the kick, a 4:1 ratio with an attack setting of 10 ms or 
slower can help emphasize the initial attack while adding depth and pres¬ 
ence. The snare attack settings might be faster, so as to catch the initial 
transients. Threshold settings should be set for a minimum amount of 
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reduction during a quiet passage, with larger amounts of gain reduction 
happening during louder sections. 

■ Synths: These instruments generally don't vary widely in dynamic range, 
and thus won't require much (or any) compression. If needed, a 4:1 ratio 
with modérate settings can help keep synth levels in check. 

■ Vocals: Singers (especially inexperienced ones) will often place the mic 
cióse to their mouths. This can cause wide volume swings that change 
with small moves in distance. The singer might also shout out a line just 
after delivering a much quieter passage. These and other situations lead 
to the careful need for a compressor, so as to smooth out variations in 
level. A good starting point would be a threshold setting of 0 dB, with a 
ratio of 4:1 with attack and release settings set at their midpoints. Gain 
reductions that fall between 3 and 6 dB will often sit well in a mix 
(although some rock vocalists will want greater compression) ... be careful 
of overcompression and its adverse pumping artifacts. Given digital's wide 
dynamic range, you might consider adding compression later within the 
mixdown phase, rather than during the actual session. 

■ Final mix compression: It's often a common practice to compress an entire 
mix during mixdown. If the track is to be professionally mastered, you 
should consult with the mastering engineer before the deed is done (or 
you might provide him or her with both a compressed and uncompressed 
versión). When applying bus compression, it is usually a good idea to start 
with médium attack and release settings, with a light compression ratio 
(say, 4:1). With these or your preferred settings, reduce the threshold 
detection until a light amount of compression is seen on the meter display. 
Levels of between 3 and 6 dB will provide a decent amount of compres¬ 
sion without audible pumping or artifacts (given that a well-designed unit 
or plug-in is used). 



do it yourself 


Tutorial: Compression 


1. Go to the “Tutorial” section of www.modrec.com and 
download the tutorial soundfiles that relate to 


sor begins to kick in. Can you hear a difference? Can 
you see a difference on the consolé or mixer meters? 


compression (which include instrument/music 
segments in various dynamic States). 


4. Set the levels and threshold to a level you like and 
then set the attack time to a slow setting. Now, select 
a faster setting and continué until it sounds natural. 
Try setting the release to its fastest setting. Does it 
sound better or worse? Now, select a slower setting 
and continué until it sounds natural. 


2. Listen to the tracks. If you have access to an editor or 
DAW, import the files and look at the waveform 


amplitudes for each example. If you’d like to DIY, then ... 


3. Record or obtain an uncompressed bass guitar track 
and monitor it through a compressor or compression 
plug-in. Increase the threshold level until the compres- 


5. Repeat the above routine and settings using a snare 
drum track. Were your findings different? 
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FIGURE 14.27 

Universal Audio’s 
UAD Precisión 
Multiband plug-in. 
(Courtesy of 
Universal Audio, 
www.uaudio.com. 

© 2009 Universal 
Audio, Inc. All rights 
reserved. Used with 
permission.) 



Multiband compression 

Multiband compression (Figure 14.27) works by breaking up the audible spec- 
trum into various frequency bandwidths through the use of múltiple bandpass 
filters. This allows each of the bands to be isolated and processed in ways that 
strictly minimize the problems or maximize the benefits in a particular band. 
Although this process is commonly done in the final mastering stage, multiband 
techniques can also be used on an instrument or grouping. For example: 

■ The dynamic upper range of a slap bass could be lightly compressed, while 
heavier amounts of compression could be applied to the instrument's 
lower register. 

■ An instrument's high end can be brightened simply by adding a small 
amount of compression. This can act as a treble boost while accentuating 
some of the lower-level high frequencies. 

More information on the subject can be found in Chapter 18 (Mastering). 

Limiting 

If the compression ratio is made large enough, the compressor will actually 
become a limiter. A limiter (Figure 14.28) is used to keep signal peaks from 
exceeding a certain level in order to prevent the overloading of amplifier signáis, 
recorded signáis onto tape or disc, broadcast transmission signáis, and so on. 
Most limiters have ratios of 10:1 (above the threshold, for every 10-dB increase 
at the input there will be a gain of 1 dB at the output) or 20:1 (Figure 14.29), 
although some have ratios that can range up to 100:1. Since a large increase 
above the threshold at the input will result in a very small increase at its output, 
the likelihood of overloading any equipment that follows the limiter will be 
greatly reduced. Commonly, limiters have three basic functions: 
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FIGURE 14.28 

Waves Ll 
Ultramaximizer 
limiting/quantization 
plug-in. (Courtesy of 
Waves Ltd., www. 
waves.com.) 



FIGURE 14.29 

The output of a 
limiter is linear below 
the threshold and 
follows a high input/ 
output gain reduction 
ratio (10:1, 20:1 or 
more) above this 
point. 


■ To prevent signal levels from increasing beyond a specified level: Certain types 
of audio equipment (often those used in broadcast transmission) are 
often designed to opérate at or near their peak output levels. Significantly 
increasing these levels beyond 100% would severely distort the signal and 
possibly damage the equipment. In these cases, a limiter can be used to 
prevent signáis from significantly increasing beyond a specified output 
level. 

■ To prevent short-term peaks from reducing a program's average signal level: 
Should even a single high-level peak exist at levels above the program's 
rms average, the average level can be significantly reduced. This is espe- 
cially true whenever a digital audio file is normalized at any percentage 
valué, because the peak level will become the normalized máximum valué 
and not the average level. Should only a few peaks exist in the file, they 
can easily be zoomed in on and manually reduced in level. If múltiple 
peaks exist, then a limiter should be considered. 
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■ To prevent high-level, high-frequency peaks from distorting analog tape: When 
recording to certain media (such as cassette and videotape), high-energy, 
transient signáis actually don't significantly add to the program's level, 
relative to the distortion that could result from their presence (if they 
saturated the tape) or from the noise that would be introduced into the 
program (if the signal was recorded at such a low level that the peaks 
wouldn't distort). 

LJnlike the compression process, extremely short attack and release times are 
often used to quickly limit fast transients and to prevent the signal from being 
audibly pumped. Limiting a signal during the recording and/or mastering phase 
should only be used to remove occasional high-level peaks, as excessive use 
would trigger the process on successive peaks and would be noticeable. If the 
program contains too many peaks, it's probably a good idea to reduce the level 
to a point where only occasional extreme peaks can be detected. 


DI Y 

do it yourself 


1. Go to the “Tutorial” section of www.modrec.com, click 
on “Ch. 14—Limiting” and download the soundfiles 
(which include instrument/music segments ¡n various 
States of limiting). 

2. Listen to the tracks. If you have access to an editor 
or DAW, import the files and look at the waveform 
amplitudes for each example. If you’d like to DIY, 
then ... 

3. Feed an isolated track or entire mix through a limiter 
or limiting plug-in. 


With the limiter switched out, tum the signal up until 
the meter begins to peg (you might want to tum the 
monitors down a bit). 

5. Now reduce the level and tum it up again ... this time 
with the limiter switched ¡n. Is there a point where the 
level stops ¡ncreasing, even though you’ve ¡ncreased 
the input signal? What does the gain reduction meter 
show? Decrease and increase the threshold level and 
experiment with the signal’s dynamics. What did you 
find out? 


Tutorial: Limiting 

4. 


Expansión 

Expansión is the process by which the dynamic range of a signal is proportion- 
ately increased. Depending on the system's design, an expander (Figure 14.30) 
can opérate either by decreasing the gain of a signal (as its level falls below 
the threshold) or by increasing the gain (as the level rises above it). Most 
expanders are of the first type, in that as the signal level falls below the expan¬ 
sión threshold the gain is proportionately decreased (according to the slope 
ratio), thereby increasing the signal's overall dynamic range (Figure 14.31). 
These devices can also be used as noise reducers. You can do this by adjusting 
the device so that the noise is downwardly expanded during quiet passages, 
while louder program levels are unaffected or only moderately reduced. As with 
any dynamics device, the attack and release settings should be carefully set to 
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FIGURE 14.30 

The Aphex Model 
622 Logic-Assisted 
Expander/Gate. 
(Courtesy of Aphex 
Systems, Inc., www. 
aphex.com.) 


FIGURE 14.31 

Commonly, the 
output of an 
expander is linear 
above the threshold 
and follows a low 
input/output gain 
expansión ratio 
below this point. 


best match the program material. For example, choosing a fast release time for 
an instrument that has a long sustain can lead to audible pumping effects. 
Conversely, slow release times on a fast-paced, transient instrument could cause 
the dynamics to return to its linear State more slowly than would be natural. 
As always, the best road toward understanding this and other dynamics pro- 
cesses is through experimentation. 

The noise gate 

One other type of expansión device is the noise gate (Figure 14.32). This device 
allows a signal above a selected threshold to pass through to the output at unity 
gain and without dynamic processing; however, once the input signal falls 
below this threshold level, the gate acts as an infinite expander and effectively 
mutes the signal by fully attenuating it. In this way, the desired signal is allowed 
to pass while background sounds, instrument buzzes, leakage or other unwanted 
noises that occur between pauses in the music aren't. Here are a few reasons 
why a noise gate might be used: 

■ To reduce leakage between instruments. Often, parts of a drum kit fall into 
this category; for example, a gate can be used on a high-tom track in order 
to reduce excessive leakage from the snare. 

■ To eliminate noise from an instrument or vocal track during silent 
passages. 

The general rules of attack and release apply to gating as well. Fortunately, these 
settings are a bit more obvious during the gating process than with any other 
dynamic tool. Improperly set attack and release times will often be immediately 
obvious when you're listening to the instrument or vocal track (either on its 
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FIGURE 14.32 

Noise gates are 
commonly included 
within many dynamic 
plug-in processors. 
(Courtesy of 
Steinberg Media 
Technologies, GmbH, 
www.steinberg.net.) 
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own or within a mix) because the sound will cut in and out at inappropriate 
times. 

Commonly, a key input (as previously shown in Figure 14.6) is included as a 
side-chain path to a noise gate. A key input is an external control that allows 
an external analog signal source (such as a miked instrument, synthesizer or 
oscillator) to trigger the gates audio output path. For example, a mic or recorded 
track of a kick drum could be used to key a low-frequency oscillator. Whenever 
the kick sounds, the oscillator will be passed through the gate. By combining 
the two, you can have a deep kick sound that'll make the room shake, rattle 
and roll. 






































































Signal Processing 4APTER 


TIME-BASED EFFECTS 

Another important effects category that can be used to alter or augment a signal 
revolves around delays and regeneration of sound over time. These time-based 
effects often add a perceived depth to a signal or change the way we perceive 
the dimensional space of a recorded sound. Although a wide range of time- 
based effects exist, they are all based on the use of delay (and/or regenerated 
delay) to achieve such results as: 

■ Time-delay or regenerated echoes, chorus and flanging 

■ Reverb. 

Delay 

One of the most common effects used in audio production today alters the 
parameter of time by introducing various forms of delay into the signal path. 
Creating a delay circuit is a relatively simple task to accomplish digitally. 
Although dedicated delay devices (often referred to as digital delay lines, or 
DDLs) are readily available on the market, most multifunction signal processors 
and time-related plug-ins are capable of creating this straightforward effect 
(Figure 14.33). In its basic form, digital delay is accomplished by storing 
sampled audio directly into RAM. After a defined length of time (usually mea- 
sured in milliseconds), the sampled audio can be read out from memory for 
further processing or direct output (Figure 14.34). Llsing this basic concept, a 
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FIGURE 14.33 

Pro Tools Mod Delay 
II. (Courtesy of 
Digidesign, a división 
of Avid Technology, 
Inc., www.digidesign. 
com.) 
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FIGURE 14.34 

A digital delay device 
stores sampled audio 
into RAM, where it 
can be read out at a 
later time. 
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FIGURE 14.35 

ln certain instances, 
doubling can fool 
the brain into 
thinking that more 
instruments are 
playing than actually 
are. 


wide range of effects can be created simply by assembling circuits and program 
algorithms into blocks that can introduce delays or regenerated echo loops. Of 
course, these circuits will vary in complexity as new blocks are introduced. 

DELAY IN ACTION! 

Less than 15 ms 

Probably the best place to start looking at the delay process is at the sample 
level. By introducing delays downward into the microsecond (one millionth of 
a second) range, control over a signal's phase characteristics can be introduced 
to the point where selective equalization actually begins to occur. In effect, 
controlling very short-term delays is actually how EQ is carried out in the digital 
domain! 

Whenever delays that fall below the 15-ms range are slowly varied over time 
and then are mixed with the original undelayed signal, an effect known as 
combing is created. Combing is the result of changes that occur when equalized 
peaks and dips appear in the signal's frequency response. By either manually or 
automatically varying the time of one or more of these short-term delays, a 
constantly shifting series of effects known as flanging can be created. Depending 
on the application, this effect (which makes a unique "swishing" sound that's 
often heard on guitars or vocals) can range from being relatively subtle to having 
modérate to wild shifts in time and pitch. It's interesting to note the differences 
between the effects of phasing and flanging. Phasing uses all-pass filters to create 
uneven peaks and notches, whereas flanging uses delay lines to create even 
peaks and notches ... although, the results are somewhat similar. 

15 to 35 ms 

By combining two identical (and often slightly delayed) signáis that are slightly 
detuned in pitch from one another, an effect known as chorusing can be created. 
Chorusing is an effects tool that's often used by guitarists, vocalists and other 
musicians to add depth, richness and harmonic structure to their sound. Increas- 
ing delay times into the 15- to 35-ms range will create signáis that are spaced 
too closely together to be perceived by the listener as being discrete delays. 
Instead, these closely spaced delays create a doubling effect when mixed with an 
instrument or group of instruments (Figure 14.35). In this instance, the delays 
actually fool the brain into thinking that more instruments are playing than 
actually are ... subjectively increasing the sound's density and richness. This 
effect can be used on background vocals, horns, string sections and other 
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grouped instruments to make the ensemble sound as though it has doubled (or 
even tripled) its actual size. This effect also can be used on foreground tracks, 
such as vocals or instrument solos, to create a larger, richer and fuller sound. 
Some "chorus" delay devices introduce slight changes in delay and pitch shift- 
ing, allowing for detunings that can create an interesting, humanized sound. 

Should time or budget be an issue, it's also possible to create this doubling 
effect by actually recording a second pass to a new set of tracks. Using this 
method, a 10-piece string section could be made to sound like a much larger 
ensemble. In addition, this process automatically gives vocals, strings, keyboards 
and other legato instruments a more natural effect than the one you get by using 
an electronic effects device. This having been said, these devices can actually go 
a long way toward duplicating the effect. Some delay devices even introduce 
slight changes in delay times in order to create a more natural, humanized 
sound. As always, the method you choose will be determined by your style, your 
budget and the needs of your particular project. 

More than 35 ms 

When the delay time is increased beyond the 35- to 40-ms point, the listener 
will begin to perceive the sound as being a discrete echo. When mixed with the 
original signal, this effect can add depth and richness to an instrument or range 
of instruments that can really add interest to an instrument within a mix. 

Adding delays to an instrument that are tied to the tempo of a song can go even 
further toward adding a degree of depth and complexity to a mix. Most delay- 
based plug-ins make it easy to inserí tempo-based delays into a track, often by 
simply pressing a button. For hardware delay devices, it's usually necessary to 
calcúlate the tempo math that's required to match the session. Here's the simple 
math for making the calculations: 


60,000/tempo = time (in ms) 60,000/100 = 600 ms 

For example, ¡f a song’s tempo is 100 bpm (beats per Using divisions of this figure (300, 150, 75, etc.) would 

minute), then the amount of delay needed to match the insert delays at 1/2, 1/4, 1/8th measure intervals, 

tempo at the beat level would be: 


Caution should be exercised when adding delay to an entire musical program, 
because the program could easily begin to sound muddy and unintelligible. By 
feeding the delayed signal back into the circuit, a repeated series of echo ... 
echo ... echoes can be made to simúlate the delays of yesteryear—you'll defi- 
nitely notice that Elvis is still in the house. 



Time-Based Effects 


01 y y 

do it yourself 

Tutorial: Delay 


1. Go to the “Tutorial” section of www.modrec.com, click 
on “Ch. 14—Delay” and download the soundfile (which 
ineludes segments with varying degrees of delay). 

2. Listen to the track. If you’d like to DIY, then ... 

3. Insert a digital delay unit or plug-in into a program 
channel and balance the dry track’s output mix so that 
the input signal is set equally with the delayed output 
signal. (Note: If there is no mix control, route the delay 
unit’s output to another input strip and combine 
delayed/undelayed signáis at the consolé.) 

4. Listen to the track with the mix set to listen equally to 
the dry and effected signal. 


5. Vary the settings over the 1- to 10-ms range. Can you 
hear any rough EQ effects? 

6. Manually vary the settings over the 10- to 35-ms 
range. Can you simúlate a rough phasing effect? 

7. Increase the settings above 35 ms. Can you hear the 
discrete delays? 

8. If the unit has a phaser setting, tum it on. ... How 
does it sound different? 

9. Now change the delay settings a little faster to create 
a wacky flange effect. If the unit has a flange setting, 
tum it on. Try playing with the time-based settings that 
affect its sweep rate. ... Fun, huh? 


Reverb 

In professional audio production, natural acoustic reverberation is an extremely 
important tool for the enhancement of music and sound production. A properly 
designed acoustical environment can add a sense of space and natural depth to 
a recorded sound that'll often affect the performance as well as its overall sonic 
character. In situations where there is little, no or substandard natural ambience, 
a high-quality reverb device or plug-in (Figure 14.36) can be extremely helpful 
in filling the production out and giving it a sense of dimensional space and 
perceived warmth. In fact, reverb consists of closely spaced and random múl¬ 
tiple echoes that are reflected from one boundary to another within a deter- 
mined space (Figure 14.37). This effect helps give us perceptible cues as to the 
size, density and nature of a space (even though it might have been artificially 
generated). These cues can be broken down into three subcomponents: 

■ Direct signal 

■ Early reflections 

■ Reverberation. 


The direct signal is heard when the original sound wave travels directly from the 
source to the listener. Early reflections is the term given to those first few reflec¬ 
tions that bounce back to the listener from large, primary boundaries in a given 
space. Generally, these reflections are the ones that give us subconscious cues 
as to the perception of size and space. The last set of reflections makes up the 
signal's reverberation characteristic. These sounds are comprised of zillions of 
random reflections that travel from boundary to boundary within the confines 
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FIGURE 14.36 
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FIGURE 14.37 

Signal level versus 
reverb time. 


of a room. These reflections are so closely spaced in time that the brain can't 
discern them as individual reflections, so they're perceived as a single, densely 
decaying signal. 

REVERBTYPE5 

By varying program and setting parameters, a digital reverb device can be used 
to simúlate a wide range of acoustic environments, reverb devices and special 
effeets. A few popular categories inelude: 

■ Hall: Simulates the acoustics of a concert hall. This is often a diffuse, lush 
setting with a longer RT60 decay time (the time that's required for a sound 
to decay by 60 dB). 
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■ Chamberí Simulates the acoustics of an echo chamber. Like a live chamber, 
these settings often simúlate the brighter reflectivity of tile or cement 
surfaces. 

■ Room: As you might expect, these settings simúlate the acoustics of a mid- 
to large-sized room. It's often best suited to intimate solo instruments or 
a chamber atmosphere. 

■ Live (stage): Simulates a live performance stage. These settings can vary 
widely but often simúlate long early-delay reflections. 

■ Spring: Simulates the low-fidelity "boingyness" of yesteryear's spring reverb 
devices. 

■ Píate: Simulates the often-bright diffuse character of yesteryear's metallic 
píate reverb devices. These settings are often used on vocals and percussion 
instruments. 

■ Reverse: These backward-sounding effects are created by reversing the decay 
trail's envelope so that the decay increases in level over time and is quickly 
cut off at the tail end, yielding a sudden break effect. This can also be 
realistically created in a DAW by reversing a track or segment, applying 
reverb, and then reversing it again to yield a true backward reverb trail. 

■ Gate: Cuts off the decay trail of a reverb signal. These settings are often 
used for emphasis on drums and percussion instruments. 
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do it yourself 

Tutorial: Reverb Types 

1. Go to the “Tutorial” section of www.modrec.com, click 2. Listen to the track. 
on “Ch. 14—Reverb Types” and download the 
soundfile. 


Psychoacoustic enhancement 

A number of signal processors rely on psychoacoustic cues in order to fool the 
brain into perceiving a particular effect. The earliest and most common of these 
devices are those that enhance the overall presence of a signal or entire record- 
ing by synthesizing upper-range frequency harmonics and inserting them into 
a mix in order to brighten the perceived sound. Although the additional har¬ 
monics won't significantly affect the program's overall volume, the effect is a 
marked increase in its perceived presence. Other psychoacoustic devices that 
make use of complex harmonic, phase, delay and equalization parameters have 
become standard production tools in the field of mastering in order to shape 
the final sound into one that's interesting, with a sonic character all its own. 

In addition to synthesizing harmonics in order to change or enhance a record- 
ing or track, other digital psychoacoustic processors deal exclusively with the 
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subject of spatialization (the placement of an audio signal within a three- 
dimensional acoustic field), even though the recording is being played back over 
stereo speakers. By varying the parameters of a stereo or múltiple input source, 
this processing function creates phase and amplitude paths that can fool the 
brain into perceiving that the stereo image is actually emanating from a sound- 
field that's wider than the physical speaker positions. In practice, care should 
be taken when using these devices, because the effect is often carried off with 
degrees of success that vary from system to system. In addition, the use of phase 
relationships to expand the stereo soundfield can actually cause obvious cancel- 
lation problems when the program is listened to in mono. 

Pitch shifting 

Ever had a perfectly good vocal take that was spoiled by just one or two fíat 
notes? Or had a project come in the door with a guitar track that was out of 
tune? Or needed to change the key on a 30-second radio spot? It's times 
like these that pitch shifting can save your day! Pitch shifting can be used to vary 
the pitch of a signal or soundfile (either upward or downward) in order to 
transpose the relative pitch of an audio program without affecting its duration. 
This process can take place in either real time or non-real time. Pitch shifting 
works by writing sampled audio data to a temporary memory, where it's 
resampled to either a higher or a lower sample rate (according to the desired 
final pitch). Once this is done, the processor either adds interpolated samples 
to (lowers the pitch) or subtracts them from (raises the pitch) the resampled 
data to return it back to the original output rate, while keeping the altered 
pitch intact. Figure 14.38 gives two basic examples of how this is often 
carried out. 

A degree of caution should be used when changing the pitch of a program or 
audio segment. Whenever uneven or minute interval changes are made, the 
interpolation of samples doesn't always fall perfectly into place. This can lead 
to digital artifacts that add unacceptable amounts of harmonic distortion. If the 
track is in the background, there shouldn't be a problem; however, care should 
be taken with upfront instruments and vocals. It's important to keep in mind 
that large pitch changes might be more noticeable. As always, your ears are the 
best judge. 

Time and pitch changes 

By combining variable sample rates and pitch shifting techniques, it's possible 
to create three different variations: 

■ Time change: A program's length can be altered, without affecting its pitch, 
by raising or lowering its playback sample rate. 

■ Pitch change: A program's length can remain the same while pitch is shifted 
either up or down. 

■ Both: Both a program's pitch and length can be altered by means of simple 
resampling techniques. 
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FIGURE 14.38 

Two pitch shift 
examples with an 
initial 1-kHz digital 
signal and a sample 
rate of 44.1 kHz: 

(a) The signal can be 
halved in pitch (to 
500 Hz) by internally 
downsampling to a 
new rate of 22.05 k. 
To return the output 
rate to 44.1 (while 
retaining the 500-Hz 
pitch), new sample 
points must be 
added into each 
dropped position. 

(b) The signal can be 
doubled in pitch (to 

2 kHz) by internally 
upsampling to a new 
rate of 88.2 k. To 
return the output 
rate to 44.1 (while 
retaining the 2-kHz 
pitch), every other 
sample point must 
be dropped. 
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These functions have become an important part of the signal processing and 
music production arsenals that are used by the audio-for-video, film and broad- 
cast industries. These tools help give producers control over the running time 
of film video and audio soundtracks while maintaining the original, natural 
pitch of voice, music and effects. For example, using a DAW, we could add a 
5-second tráiler onto the end of an existing 30-second public Service radio spot 
simply by time compressing the 30-second spot to 25 seconds (while keeping 
the pitch intact) and then adding the tráiler. 

In addition to the basic time/pitch techniques that are commonly used in music 
production (most often by electronic musicians), this technology has allowed 
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for the huge explosión in loop-based music composition and production. These 
popular programs and music plug-ins involve the use of recorded soundfiles 
that are encoded with headers that inelude information on their native tempo 
and length (in both samples and beats). When you set the loop program to a 
master tempo (or a specific tempo at that point in the score), a loop segment, 
once imported, can go about the process of recalculating its pitch and tempo 
to match the current session tempo and—voilá! The file's in sync with the song! 
Further info on loop-based production tools can be found in Chapter 8 (Groove 
Tools and Techniques). 

MULTIPLE-EFFECTS DEVICES 

Since most digital signal processors are by nature multifunctional chameleons, 
it follows that most hardware and certain plug-in processors can be easily pro- 
grammed to perform various functions. For this reason, many digital Systems 
have been designed to perform as multiple-effects devices (Figure 14.39). Múl¬ 
tiple effeets, in this case, can have several basic meanings: 

■ A single device might offer a wide range of processing functions but allow 
only one effect to be called up at a time. 

■ A single device might offer a range of processing functions that can be 
"stacked" to perform a number of simultaneous effeets. 

■ An effeets device might have múltiple ins and outs, each of which 
can perform several processing functions (effectively giving you 
múltiple processors that can be used in a multichannel mixdown 
environment). 



FIGURE 14.39 

Lexicón 960L 
multi-effects digital 
effeets system. 
(Courtesy of Lexicón, 
Inc., www.lexicon. 
com.) 
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DYNAMIC EFFECTS AUTOMATION AND EDITING 

One of the joys of working with effects is the ability to manipúlate and vary 
effects parameters in real time over the duration of a song or audio program. 
By altering parameters, changing settings and mixing effects levels ... the subtle 
variations in expression can add a great deal of interest to a project. 

The ability to dynamically automate effects settings can be accomplished in any 
number of ways, including: 

■ Via MIDI control and parameter change messages 

■ Via external hardware controller 

■ Via DAW or other form of automation control. 

In closing, an almost unlimited degree of effects control is available to us 
through most high-level (and many entry-level) digital audio workstations. 
Through the use of any of the readily available hardware controllers or on-screen 
automation Controls, it's possible to manipúlate and automate effects within a 
DAW session with an amazing degree of sophistication and ease. 
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do it yourself 

Tutorial: DAW Effects Automation 


The vast majority of plug-in effects can be directly and 
dynamically automated within the computer’s DAW 
program. It’s my greatest hope that you’ll: 

■ Take a look at your favorite DAW manual and start 
reading! 

■ Open up a tutorial session or better yet, make your 
own. 


■ Cali up some effects on various tracks. 

■ Cali up some automation control parameters and start 
grabbing Controls. 

■ Leam how to edit these automation functions, so as to 
be able to finesse your effects in new and interesting 
ways. 


Go ahead, get hold of these fun and effective tools ... and experiment your 
heart out! 


CHAPTER15 


Noise Reduction 


With the advent of digital audio, the CD, newer microphone technologies and 
surround-sound home theaters, an increase in dynamic range and a demand 
for better quality sound have been on the rise. Because of this, it's more impor- 
tant than ever for those in audio production to pay cióse attention to the back- 
ground noise levels that are produced by amplifier self-noise, analog magnetic 
tape and the like. Although the overall dynamic range of human hearing roughly 
encompasses a full 140 dB, this often can't be fully captured by a recording or 
delivery médium for several reasons: 

■ An acoustic or electronic weak link in the chain might be introducing 
noise and/or restricting the program's dynamic range. 

■ The médium itself might be incapable of encoding a wide dynamic range. 

These scenarios can easily crop up when transmitting a signal through the air- 
waves or a telephone line, when a poorly designed device is used in the signal 
chain or when analog tape tracks are used. Although the Solutions to reducing 
noise that are induced into an analog path can be quite similar, the first part of 
this chapter will largely focus on the reduction of tape noise, which is a natural 
by-product of the analog recording/playback process. 

Not all the blame for added noise can be placed on our older technology 
friends. Even though a 16-bit digital recording has a theoretical dynamic range 
of 96 dB and a 24-bit system can actually encode 144 dB, noises can (and often 
will) crop up from such modern-day gremlins as mic preamps, effects and out- 
board gear, analog communication lines and poorly designed digital audio 
converters, not to mention from recordings that were made under adverse and/ 
or noisy acoustic conditions. For these reasons, the latter half of the chapter will 
deal with single-ended and digital noise reduction processes that can help 
reduce noises that have been introduced into the recording chain from any 
number of gremlin sources. 

ANALOG NOISE REDUCTION 

The roughly 60-dB signal-to-noise (S/N) limitation that's imposed on conven- 
tional analog ATR and VTR audio tracks is dictated by tape hiss (which is heard 
when the overall recorded level of the program is too low). Should an optimum 



Analog Noise Reduction 


level produce an unacceptable amount of noise, the engineer is faced with 
several options: record at a higher level (with the possibility of increased distor- 
tion), change the signal's overall dynamic range by raising low-level signáis 
above the noise (compression or limiting) or introduce a system into the chain 
that can alter the dynamic range of the recording médium itself. 

Analog tape noise might not be much of a problem when dealing with one or 
two tracks in an audio production, but the combined noise and other distor- 
tions that can occur when 8, 16, 24 or 48 tracks are combined can range from 
being bothersome to downright unacceptable. The following types of noises are 
often major contributors to the problem: 

■ Tape and amplifier noise 

■ Crosstalk between tracks 

■ Print-through 

■ Modulation noise. 


Modulation noise is a high-frequency component that causes sonic "fuzziness" 
by introducing sideband frequencies that can distort the signal (Figure 15.1). 
This noise-based distortion is due to the magnetic and mechanical properties 
of the analog recording process itself, and actually increases as recorded levels 
rise. This noise is often higher in level than you might expect, and when com¬ 
bined with asperity noise (sideband frequencies that are also introduced by the 
analog record/playback process) can definitely play a role in what could be 
called the "analog sound." 
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do it yourself 

Tutorial: Analog Tape Modulation and Asperity Noise 

1. Feed a 0-VU, 1-kHz test tone to a track on a profes- 3. Switch the recorder to monitor the tone from the 

sional analog recorder. track’s playback (tape) head. Does ¡t sound different? 

2. Listen to the recorder’s source (input) signal through 
the monitors at a modérate level. 


FIGURE 15.1 

Modulation noise of a 
recorded analog sine 
wave. 
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These analog-based noises can be reduced to acceptable levels by using different 
combinations of the following actions: 

■ Improving the dynamic range of professional recording tape and/or 
increasing tape speed in order to record at higher flux levels 

■ Using an ATR with wider recorded tracks (i.e., allowing for higher record/ 
playback levels and reduced crosstalk specs 

■ Using a thicker tape base to reduce print-through. 

By using tape formulations that combine low noise and high output (which 
have generally increased the S/N ratio by 3 dB or more), noise levels can be 
reduced even further. However, when all's said and done, making most or all 
of the above improvements is simply too costly and impractical. In the end, it's 
important to realize that noise is simply an inherent part of the analog record¬ 
ing process ... and the goal is to minimize its effects. 

The compansion process 

To reduce the inherent effects of noise on analog recording, a process called 
compansion was created. The compansion encode/decode process gets its ñame 
from the fact that the incoming signal is first (COMP)ressed "before" being 
recorded onto tape. Then on playback, the output signal is downwardly 
exp(ANDED) in a reciprocal fashion back to its original dynamic range (the 
tape noise that's introduced by the machine itself is also downwardly expanded 
... and is therefore likewise reduced in level, while leaving the program mate- 
rial's original dynamics intact). Because this is a reciprocal process, the encoder 
must be placed between the consolé (or signal source) and the tape track during 
the recording process; likewise, the decoder must be placed between the tape 
output and the consolé (or desired destination) upon playback. 

To better understand this process, let's take a look at Figure 15.2. In this 
example, an input signal's overall dynamic range is restricted in dynamic range 
(COMPressed) before being sent to the recorder's track input. This is done so 
that the newly compressed signal can be recorded at levels that are much higher 
than the tape noise. During playback, the recorded signal is then downwardly 
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FIGURE 15.2 

Example of a 
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noise-reduction 
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expANDED back to its original dynamic range. Since the compressed signal is 
then recorded at a level well above the tape track noise, the process of expand- 
ing the signal back to it's original dynamic range will have the effect of turning 
the noise down to a point well below its original level. 

The two most common analog compansion noise reduction units that might 
be encountered are the dbx and Dolby noise-reduction systems. 

The dbx system is a full-bandwidth compansion system that can provide between 
20 and 30 dB of noise reduction. The compression chain uses a 2:1 ratio across 
the entire frequency bandwidth between the levels of-90 and +25 dBm (with 
its unity gain point occurring at +4 dBm or 0 VU). Because all signáis are com¬ 
pressed and expanded at a 2:1 ratio (regardless of signal level), the system isn't 
overly sensitive to variations in level (although it can be sensitive to frequency 
calibrations). 

A 2:1 ratio was chosen over a higher valué because of the effect that tape drop- 
outs might have on the expansión process. Because the changes in level are 1:2 
expanded on playback, a tape dropout of 2 dB would ultimately result in a level 
drop of 4 dB. A 1:2 ratio would cause a 6-dB dropout, and while a 1:1.5 expan¬ 
sión would make the dropout problem less noticeable; it would do so at the 
cost to the noise reduction process (only 20 dB would be possible). Therefore, 
a 2:1 compansion ratio was considered a compromise between the best noise 
reduction level and oversensitivity to dropouts. 

The Dolby SR noise-reduction system operates by breaking the audio signal into 
various frequency bandwidths. This makes it possible for frequency ranges that 
contain louder passages to be unaffected, while those containing softer passages 
(especially those in the upper noise-perception range) can be appropriately 
processed. The Dolby spectral recording (SR) process is more complicated than 
that of dbx, in that its signal-shaping processor uses a side-chain circuit that 
runs parallel to the device's main audio path. Although most of the Dolby SR 
electronics are used for spectral analysis, its principal operating system consists 
of five groups of fixed- and sliding-band filters with gentle slopes that are 
arranged by level and frequency to improve companded tape noise figures by 
as much as 24 dB. 


SINGLE-ENDED NOISE-REDUCTION PROCESS 

Since a compansion system must process a signal before recording and after 
playback to achieve noise reduction, noise can't be removed from the original 
source material: It can only be prevented from being introduced by the record¬ 
ing médium itself. The adaptive fílter or single-ended noise-reduction process is 
different. It extracts noise from an audio source by combining a downward 
dynamic-range expander with a variable low-pass filter. In other words, a single- 
ended device can be used to dynamically analyze, process and EQ an existing 
program to reduce the noise content with little or no audible effects (or the best 
possible compromise, in extreme cases). 
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FIGURE 15.3 

Behringer’s Denoiser 
system Model SNR 
2000. (Courtesy of 
Behringer Int’l GmbH, 
www.behringer.de.) 


Single-ended noise reduction systems work by breaking up the audio spectrum 
into a number of frequency bands, such that whenever the signal level within 
each band falls below a user-defined threshold, the signal will be attenuated. 
This downward expansion/filtering process accomplishes noise reduction by 
taking advantage of two basic psychoacoustical principies: 

■ Music is capable of masking lower level noise that exists within the same 
bandwidth. 

■ Reducing the bandwidth of an audio signal will reduce the perceived 
noise. 

It's a psychoacoustical fact that our ears are more sensitive to noises that contain 
a greater number of frequencies than to those containing fewer frequencies. 
Thus, whenever the program's high-frequency content is reduced or restricted 
to a certain bandwidth, the dynamic filtering process will sense this and reduce 
the frequency bandwidth accordingly (thereby reducing the noise content). 
When the program's high-frequency signal returns, the filter will again pass the 
frequency bandwidth up as far as necessary to pass the signal (allowing the 
increased program content will mask the background noise). 

These single-ended noise-reduction systems are currently available in both 
analog and digital flavors. In addition to the handful of stand-alone hardware 
devices that are on the market (Figure 15.3), single-ended noise reduction can 
also be found as a preset algorithm on a number of hardware multi-effects 
processors. Single-ended software plug-ins are also available for use with DAW 
systems (Figure 15.4). 

The noise gate 

A noise gate (Figure 15.5) can also be a very effective tool for eliminating noise 
from a track or tracks within a mix. This device allows a signal above a selected 
threshold to pass through to the output at unity gain (no gain change) and 
without dynamic processing. Once the input signal falls below this threshold 
level, however, the gate acts as an infinite expander and effectively mutes back¬ 
ground noises and other unwanted sounds by attenuating them. 

When "gating" a specific track, it's often necessary to take time out to fine-tune 
the device's attack and release Controls. This is done to reduce or elimínate any 
unwanted "pumping" or "breathing" as the noise or leakage signal falls and rises 
around the threshold point. The general rules for these settings are the same as 
those that apply to gain-change processors (see the compressor, limiter and 
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FIGURE 15.4 

DeNoiser Noise app 
within Cubase/ 
Nuendo. (Courtesy 
of Steinberg Media 
Technologies GmbH, 
a división of Yamaha 
Corporation, www. 
steinberg.net.) 
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expander settings section in Chapter 14). Fortunately, these settings are often 
audibly more obvious than with any other dynamic tool. Setting the attack and 
release times either too fast or too slow will be immediately audible because 
the sound will cut in and out at inappropriate times. For this reason, care should 
be taken when adjusting the settings by both listening to the track on its own 
(solo track) and by listening to it within the context of the full mix. 

DIGITAL NOISE REDUCTION 

Digital signal processing (DSP) is commonly used to reduce noise levels within 
a recorded soundfile. These noises inelude artifaets such as tape hiss, hum, 
obtrusive background ambience, needle ticks, pops and even certain types of 
distortion that are present in the original recording. Although stand-alone 
digital noise processors do exist, most systems exist as plug-in software applica- 
tions for computer-based digital audio editors and workstations. These algo- 
rithms can be used to reduce the noise on one or more recorded tracks within 
a multitrack session, to clean up the overall noise on a mix during the master- 
ing phase or to clean up and restore older 45s, LPs and analog tapes for trans- 
fer to CD. 

Fast Fourier transf orm 

The most commonly used noise reduction programs make use of a mathemat- 
ically intense algorithm known as Fast Fourier Transf orm (FFT). These applica- 
tions and plug-ins (Figure 15.6) are able to analyze the amplitude/frequency 
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FIGURE 15.5 

Noise gates are 
commonly included 
within many dynamic 
plug-in processors. 
(Courtesy of 
Steinberg Media 
Technologies GmbH, 
www.steinberg.net.) 


domain of an audio signal in order to reduce hum, tape hiss and other extrane- 
ous noises from your recordings. This digital analysis generally begins by taking 
a digital "snapshot" of a short snippet of the offending noise (a brief section 
that contains only the noise will yield the best results). This noise témplate can 
then be digitally subtracted from the original soundfile or segment in varying 
amounts (and under the control of various program parameters), such that the 
footprint noise is reduced, while (under the best of conditions) the original 
program material is left intact and unaffected. 

Most modern FFT-based noise reduction plug-ins are able to process tracks 
or an entire mix in either real-time during playback and mixdown or, less 
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FIGURE 15.6 

Digidesign DINR 
noise-reduction 
plug-in. (Courtesy of 
Digidesign, a división 
of Avid Technology, 
Inc., www.digidesign. 
com.) 
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commonly, in non-real time (allowing the processed file to be written off-line, 
without taxing the computer's CPU during playback). 

Before closing this section, it's important that we briefly discuss a few of the 
unfortunate artifacts that can occur when using FFT-based noise reduction. The 
most notable of these is "chirping." This audible artifact most often occurs when 
too much FFT processing is applied. It literally sounds like a flock of small 
chirping birds that can either be heard in the background or in an obnoxious 
way that sounds like a bad Alfred Hitchcock movie. If you find yourself running 
for cover, it's best to pulí back on the FFT settings (and/or increase the process¬ 
ing quality level) until the artifacts are less noticeable. 

Should chirping and/or bandwidth limitations become a problem, you might 
consider using a single-ended noise reduction plug-in instead. Because these 
algorithms use an adaptive filter to intelligently change the program's band¬ 
width, no chirping artifacts will be introduced. 

Finally, it's a misconception that an FFT-based noise reduction application can 
only be used for reducing noise. Literally, ANY sound can be used as a footprint, 
and as a result, vocal formants, snare hits, etc., can be pulled from a soundfile 
to create unique and interesting effects. The sky's literally the limit! 
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From the above, it's easy to tell that the noise-reduction process isn't an exact 
Science, but more of a balancing act. Because of these and other band-limiting 
artifacts, you might strongly consider keeping your original soundfile intact, 
while writing the processed signal to a new track ... thereby keeping your 
options open for future changes, decisions and/or technological advances. 
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Click/pop eliminator 
application within 
Adobe’s Audition 3. 
(Courtesy of Adobe 
Systems, Inc., www. 
adobe.com.) 
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RESTORATION 

In addition to removing noise, programs and DAW plug-ins also exist for remov- 
ing clicks and pops from vinyl and older recordings (Figures 15.7). Although FFT 
analysis is often involved in the process, click removal differs slightly from FFT 
noise reduction. This multistep process begins by detecting high-level clicks (or 
those exceeding a user-defined threshold) that exist within a soundfile or defined 
segment. Once the offending noises are detected, the program performs a fre- 
quency analysis (both before and after the click) and then goes about the busi- 
ness of making a plausible guess as to what the damaged amplitude/frequency 
content should sound like. Finally, the calculated sound is pasted over the nasty 
offender—ideally rendering it less noticeable or gone—then moves onto the 
next click and starts the process over. 

Because click and pop noises can be different in nature from each other (both 
in duration and frequency makeup), noise-reduction plug-ins might offer appli- 
cations that are specifically suited to reducing either clicks or pops. In addition, 
certain programs also offer FFT-based applications that allow offending sounds 
(such as a cough, noisy music stand, etc.) to be visually erased in an on-screen 
drawing environment. Such tools allow us to tackle noise and other offenders 
in new and interesting ways. 
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Monitoring 


Within the recording process, our ability to judge and adjust sound is primarily 
based on what's heard through the monitor speakers in a project studio or 
control room environment (Figure 16.1). In fact, within the audio and video 
industries, the word monitor refers to a device that acts as a subjective pro- 
fessional standard or reference by which program material can be critically 
evaluated. 

Despite steady advances in design, speakers are still one of the weakest links in 
the audio chain. This weakness is generally due to potential nonlinearities that 
can exist in a speaker system's frequency response. In addition, interactions with 
a room's frequency response can often lead to peaks and dips that affect a 
speakers sonic character in ways that are difficult to predict. Add to this the 
factors of personal "tastes" in the sound, size and design types, and you'll 
quickly find that speakers are also one of the most subjective tools in a produc- 
tion environment. 

SPEAKER AND ROOM CONSIDERATIONS 

llnless you have several rooms with precisely matching dimensions, materials 
and furnishings (an unlikely scenario), you can bet your bottom buck that the 
same speaker system will sound different in different room environments. That's 
to say, it will interact with the various complex environmental factors to exhibit 
a unique frequency response curve and delay characteristic. 

Although variations between production rooms often play a big part in giving 
a facility its own particular sound, extreme variations in a room's frequency 
response can lead to difficulties that can be heard in the final product. For this 
reason, certain basic principies (which are covered in the Symmetry in Control 
Room Design section in Chapter 3) have become common knowledge to many 
who attempt the art of control room design. A few examples inelude: 

■ Reducing standing waves to help reduce erratic frequency response 
characteristics within a room 

■ Reducing excessive bass buildup in room corners through the use of 
bass traps 



Speaker and Room Cons derations 


FIGURE 16.1 

Example of a 
professional 
monitoring system: 
Paragon Studios, 
Franklin, TN. 
(Courtesy of Solid 
State Logic LTD, 
www.solid-state- 
logic.com.) 



■ Keeping the room/equipment layout symmetrical throughout a room so 
the left/right and front/rear imaging is consistent 

■ Using absorptive and reflective surfaces to help "shape" a room's sonic 
character. 

Because of the untold number of acoustic variables that are involved, a project 
that's been recorded in one facility will often sound quite different when played 
and/or mixed in another, even when high acoustical construction standards are 
followed. Fortunately for us, production and mixdown room designs have 
greatly impro ved over the last few decades. This is largely due to an added 
awareness of careful room design and the increased availability of acoustical 
producís that can help shape a room's sound. 

Beyond careful acoustic design and construction, a professional facility might 
choose to further reduce variations in frequency response by tuning (equalizing) 
its speakers to the room's acoustics so that the adjusted frequency response 
curve will be reasonably fíat and, therefore, reasonably compatible with most 
other control rooms. Tuning a speaker system to a room can be carried out in 
either of two ways: 

■ Altering settings on the speaker itself 

■ Equalizing the monitor output lines. 

One of the simplest ways to alter the acoustic and frequency response of a 
speaker system is through the careful control of the basic EQ and system setting 
Controls that are found on most actively powered speaker systems (Figure 16.2). 
These simple Controls allow the user to roughly match level and EQ settings 
to best fit their application or placement layout. Often these settings can be 
used to: 






Monitoring IAPTER16 



FIGURE 16.2 

Rear Controls for the 
Mackie HR824mk2 
active monitor 
speaker. (Courtesy of 
Loud Technologies, 
Inc., www.mackie. 
com.) 


■ Finely match audio balance levels within a stereo and surround system. 

■ Allow for basic high- and low-end tuning. 

■ Compénsate for bass buildup (whenever speakers are placed in or near a 
córner or other large boundary). 

■ Offer various speaker "emulation" modes. 

Larger, passive monitors (often a farfield pair) can be tuned by placing a 1/3- 
octave bandwidth graphic equalizer between each of the console's control room 
monitor outputs and the power amplifier. Of course, there are various ways to 
fine-tune a speaker system and room response to improve a studio's overall 
monitoring conditions. The simplest approach is to place a high-quality omni- 
directional mic at the center listening position and inserí it into a channel strip 
on your DAW. By recording a loop of white noise (or another noise color— 
search Wikipedia for "colors of noise") and playing it back equally to each 
speaker in the system, basic level matching and frequency measurements can 
be taken. 

It should be noted that the above measurement method is often inaccurate (due 
to time-based variables and other acoustical factors). Fortunately, stand-alone 
software can accurately measure such variables as level, EQ and time delay 
reflections, to help with the accurate tuning within a control room, studio, 
performance hall or auditorium. Note also that these measurements are often 
best interpreted by those who are well versed in acoustics, studio design and 
the fine art of common sense ... in other words, careful fine-tuning "might" be 
best leff to a competent professional. 

SPEAKER DESIGN 

Just as the sound of a speaker system will vary when heard in different acoustic 
environments, speakers of different design and operating type will usually 
sound very different from one another, even when heard in the same room. 
Enclosure size, number of components and driver size, crossover frequencies 
and design philosophy contribute greatly to these differences in sound quality. 

Usually professional speaker enclosures are one of two design types: air suspen¬ 
sión and bass reflex. An air-suspension speaker enclosure (Figure 16.3a) is an 
airtight system that seáis the air in its interior from the outside environment. 
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This system type (which is often used in "book- 
shelf" designs) generally provides a strong, 
"tight" bass response, while often being rolled 
off at the extreme low end. The bass-reflex or 
vented-box design (Figure 16.3b), uses a tuned 
bass porthole that's designed into the front or 
rear of the speaker enclosure. This allows the air 
mass inside the enclosure to mix freely with the 
outside air in such a way as to act as a tuned 
resonator (which serves to acoustically boost the 
speakers output at the extreme lower octaves). 


(a) 

FIGURE 16.3 

Speaker enclosure 
designs: (a) air 
suspensión; (b) bass 
reflex. 


Crossover networks 

Because individual speaker elements (drivers) 
are more efficient in some frequency ranges than others, different driver sizes 
and types are often used in combination to give the desired frequency response 
and level output. For example, large-diameter bass drivers (such as 10- and 
12-inch units) produce low-frequency information more efficiently than at high 
frequencies; medium-sized speakers (such as 4- and 5-inch units) opérate best 
in the midrange frequencies; and small speakers (such as y 2 - to l)^-inch dia- 
phragm sizes) reproduce highs more effectively. 


These speakers are usually connected to a series of passive crossover networks, 
which are used to divide the frequency spectrum into two or more bands. These 
bands split the frequency range in such a way that the speakers are driven in 
the most efficient manner (i.e., signáis below the crossover frequency are routed 
to the bass driver or drivers, while mid and/or high frequencies are routed to 
the high drivers, as shown in Figure 16.4). 


If a speaker system has only one crossover frequency, it is called a two-way 
system, because the signáis are divided into two frequency bands. Likewise, if 
the signal has two crossover frequencies, it's called a three-way system. The 
Westlake Audio BBSM-15 monitor speaker (Figure 16.5), for example, is a 
ported three-way system that ineludes two 10-inch woofers for the bass, a 6.5- 
inch driver for the midrange frequencies, and a 1.5-inch diaphragm dome 
tweeter for the highs with the crossover frequencies being tuned at 600 Hz and 
4 kHz, respectively. 


Actively powered vs. passive speaker design 

As you might expect, many of the more popular monitor types that are in use 
today incorpórate an actively powered amplifier into their design. These cost- 
effective systems have become widely accepted by the professionals and project 
communities due to their: 

■ Compact design 

■ High-quality sound (often these systems are bi- or tri-amplified) 
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(a) 



(b) 


FIGURE 16.4 

Example of a 
passive two-way 
crossover system: 

(a) crossover/amp 
layout; (b) frequency 
response curves 
showing crossover 
frequencies of 
1500 Hz. 



FIGURE 16.5 

Westlake BBSM-15 
Reference Series 
monitor. (Courtesy 
of Westlake Audio, 
www.westlakeaudio. 
com.) 
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FIGURE 16.6 

Example of an active 
two-way active 
crossover system. 
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■ Expandability (additional speakers can be cost effectively added for sur- 
round-sound monitoring) 

■ Lack of a need for an external power amplifier. 

For these reasons, these systems are often ideal for project- and DAW-based 
facilities and are steadily increasing in popularity. 

Electronic crossover networks (Figure 16.6), called active crossovers, differ from 
conventional passive crossover systems in that the line-level audio signal is split 
into various frequency bands that use more complex analog and digital circuitry. 
Each equalized line is fed to its own power amp, which in turn is used to drive 
the respective bass, mid-, and/or high-driver elements. Such a system is generally 
referred to as being bi-amplified or tri-amplified, depending on the number of 
crossovers and power amps that are required per channel. These systems have 
several advantages: 

■ The crossover signáis are low in level, meaning that inductors (which can 
introduce audible ringing and intermodulation distortion) can be elimi- 
nated from the design. 

■ Power losses (due to the inductive resistance within the passive crossover 
network) can be eliminated. 

■ Each frequency range has its own power amp, so the full power of each 
amplifier in the respective speaker efficiency range will be available 
(meaning that the drawing of excessive current in one range won't affect 
the sound in another frequency range). 

Because there are so many variables in speaker and amplifier design, it quickly 
becomes clear that there's no such thing as the "ideal" monitor system. The final 
choice is often more of a matter of personal taste and current marketing trends 
than one of subjective measurements. Monitors that are widely favo red over a 
long period of time tend to become regarded as the industry standard; however, 
this can easily change as preferences vary. Again, the best judge of what works 
best for you should be your own ears and personal sense of style. 
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SPEAKER POLARITY 

A common oversight that can drastically affect the sound of a passive multi- 
speaker system is to wire them out-of-phase with respect to each other. Speaker 
polarity is said to be electrically in-phase (Figure 16.7a) whenever one signal 
that's equally applied to both speakers causes their cones to move in the same 
direction (either positively or negatively). When the speakers are wired out-of- 
phase (Figure 16.7b), one speaker cone will move in one direction while the 
other will move in the opposite direction. 

Speaker polarities can be easily tested by applying a mono signal to both or all 
of the speakers at the same level. If the signal's image appears to originate from 
a localized point directly between the speakers, they have been properly wired 
(in-phase). If the image is hard to lócate and appears to originate beyond the 
outer boundaries of a stereo speaker pair, or shifts as the listener moves his or 
her head, it's a good bet that the speakers have been improperly wired (out-of- 
phase). 

An out-of-phase speaker condition can be easily corrected by checking the 
speaker wire polarities. On a passive speaker design, the "hot" lead (+ or red 
post) leading from each amp channel should be secured to the same lead on 
its respective speaker (Figure 16.8). Likewise, the negative lead (- or black post) 
should be connected to its respective lead for each speaker in the system. 

Speaker cable is generally color coded, with white or red being positive (+) and 
black being negative (-). If no color coding is present, heavy-duty power cable 
or other cabling types that are suitable for speakers can be used. These cables 








FIGURE 16.7 

Relative in-phase 
and out-of-phase 
cone motions: 

(a) in-phase; 

(b) out-of-phase. 
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FIGURE 16.8 

Color-coded banana 
plug binding posts 
are often used to 
help ensure a proper 
speaker connection. 



will often have a notched ridge (or set of ridges) or have a printed white band 
that's generally connected to the negative lead post. 

Passive speaker wire gauges should always be as heavy duty as is possible or 
practical. The #18 wire is considered to be the minimum for lengths of less than 
25 to 50 feet, while #14 is considered the minimal length that should be used 
for 50- and 100-foot runs. [Note: The smaller the gauge number, the thicker the 
wire; therefore, #14 is thicker than #18.) Two reasons for increasing the thick- 
ness of the conductor as cable length increases are: 

■ All cable has resistance, which will increase with length. Thinner cables 
generally have greater resistance valúes, meaning that more power will be 
dissipated in the cable and will therefore be unavailable to drive the 
speaker. 

■ The higher the cable resistance, the lower the amplifier's effective damping 
factor. Damping factor is related to how well the amplifier is able to 
control the motion of the speaker cone. A lowered damping will often 
result in a loss of tightness, definition, and clarity in the low end. Again, 
thicker conductors will have less resistance and thus help to minimize 
damping problems. 

MONITORING 

When mixing, it's important that the engineer be seated as closely as possible 
to the center of the soundfield (making allowances for the producer, musicians 
and others who are also doing their best to be in the "sweet spot") and that all 
the speaker volumes are adjusted equally. For example, if the engineer is closer 
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to one speaker than another, that speaker will sound louder and the engineer 
may be tempted either to pan the instruments toward the far speaker or boost 
that entire side of the mix to equalize the volumes. The resulting mix would 
sound properly centered when played in that room, but in another environ- 
ment, the mix might be off-center. As a quick check against this, the engineer 
should always make sure that an audible volume difference between speakers 
is accompanied by a corresponding visual difference on the main output VU or 
display meters. Another guard against off-center levels is to monitor noise or a 
test tone signal from each speaker in the soundfield to check that they're equally 
loud (either by doing a quick audible check or by placing an SPL meter or mic 
in the center listening position and checking the relative levels). 

Here are a few additional pointers that can help you get the best sound from 
your control room monitors: 

■ Make sure that the room's reverb time is both low and smooth over the 
audible range (absorptive and diffusive materials can help). 

■ Keep large reflections to a minimum within the room (again, absorptive 
and diffusive materials can help reduce reflections to a level that's at least 
20 dB down from the direct signal). 

■ Keep all room boundaries and reflections as symmetrical as possible along 
the L/R and front/back axis of the mixing soundfield. 

■ If diffusers are used, place them at the rear part of the room. 

■ If the speakers are mounted in soffits, make sure the front wall has a hard, 
smooth surface and that the speakers are acoustically isolated from the 
surface (as much as possible; see Chapter 3 for more info). 

■ Angle the monitors symmetrically toward the listening position in both 
the horizontal and vertical planes. 

■ If nearfield monitors are used, place them on medium-density foam 
blocks to reduce consolé- and desk-borne vibrations. 

Monitor volume 

Before continuing, I'd like to revisit another important factor—volume. During 
the record and mixdown stage, it's important to keep in mind that the Fletcher- 
Munson curves will always have a direct effect on the frequency balance of a 
mix. Because our ears perceive recorded sound differently at various monitoring 
levels, our ears will easily perceive the extreme high and low frequencies in the 
mix when monitoring at loud levels (sounds good doesn't it?). However, when 
the mix is played back at lower levels (such as over the radio, TV or Computer), 
our ears will be much less sensitive to these frequencies and the bass and 
extreme highs will probably be deficient (leaving the mix sounding distant and 
lifeless). 

LJnlike during the 1970s, when excruciatingly high SPLs tended to be the rule 
in most studios, recent decades have seen the reduction of monitor levels to a 
more modérate 75 to 90 dB SPL. (A good rule of thumb is that if you have to 
shout to communicate in a room, you're probably monitoring too loud.) These 
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modérate levels offer a good compromise for mixing, as they more accurately 
represent listening levels that are likely to be encountered in the average home 
(i.e., the Fletcher-Munson curves will be more closely matched). Ear fatigue and 
potential ear damage due to prolonged exposure to high SPLs by industry pro- 
fessionals can also be avoided at these levels. For more information on safe 
monitor levels and hearing conservation, contact the House Ear Institute at 
www.hei.org. 

The big bad bottom end 

Before we continué, it's important that we spend a bit of time talking about the 
bottom end of the audible spectrum. ... That's to say, the big-bang-boom of 
the lows! 

THELFE 

The LFE channel stands for low frequency effect and was developed by the film 
industry to give an extra "ummmffff" in the low-bass end to add an extra degree 
of impact... especially during explosions and low rumbles. Contrary to popular 
misconceptions, the ".1" aspect of an LFE has little or nothing to do with the 
musical bass line of a music program or film score. Instead, it was originally 
designed as a band-limited channel (120 Hz for Dolby Digital or 80 Hz for 
DTS) for adding 10 dB of headroom to a film. (I recall that it was first used in 
the film Earthquake to add extra realism to the earth-shaking score.) 

There is an ongoing debate among top music mixers as to whether to use the 
LFE channel at all in music mixing. Almost everyone agrees that all of the speak- 
ers within a music mixing scenario should be full range ... however, there are 
those who don't believe in using the LFE channel at all, while others will put 
some degree of low content in this channel (stating that customers feel cheated 
out of their well-earned .1 when they don't get some bottom in that channel). 
As an added note: Critical musical material should never be sent to the LFE 
channel, because Dolby decoders will completely drop it when folding 5.1 
material down to stereo. 

THESUB 

A subwoofer, in its truest sense, is an additional low-end driver that's added to 
a system for the purpose of assisting with low-end bass reproduction ... and, 
hopefully, for improving low-end definition. Although many expect a "sub" to 
add a VERY BIG BOOM to the low end, this use will almost always end up as 
an overly accentuated "big bottom" that could easily lead to a false response in 
the low end. The improper overuse of a sub could actually lead to a final master 
that's bass shy, due to excessive bass levels. 

The proper use and setup of a subwoofer can help extend the low end, while 
adding a tight definition to the bass that can actually bring a monitor playback 
system to life. As you might expect, the response and overall sound of a sub 
is often heavily influenced by the acoustics of the playback environment. In 
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addition to using a software frequency and acoustics analyzer (that's capable of 
displaying phase relationships as well as frequency response). One of the best 
ways to set up and tune a sub is by experimentation. When placing the sub- 
woofer, the following concepts should be kept in mind: 

■ Because low-frequency audio is basically nondirectional, some feel that a 
sub can be placed in any spot in the front of the room. However, a cen- 
trally placed position is most likely preferable, as a difference can be heard 
with upper-bass definition and image localization. 

■ Although a sub's output will be greater when placed near a wall boundary, 
a "muddy" or "boomy" sound might exist due to unfavorable room and 
córner reflections. 

■ Most active subs will inelude level and crossover frequency adjustments. 
These settings should be used to closely match your sub to the chosen 
speaker set or sets. 

■ Active subs will also inelude a phase switch that can match the phase 
(driver motion) to your main speaker set. Using an analyzer or your ears, 
set the phase to a position that sounds best. 

All of this leads to the fact that sub placement and setup are critical in getting 
the best sound possible. Simply put, a high-quality, properly adjusted sub can 
add life and definition to your bottom end that's a beauty to behold. 

BA55 MANAGEMENT 

At first glance, bass management might look like an LFE channel or a simple 
subwoofer system (which is a good reason why bottom-end issues can get so 
confusing). In reality, it is entirely different in that a bass management system 
(which is used in most home theatre Systems and high-end auto Systems) use 
filters to extract low-frequency information from the main channels and then 
routes the bass to the bass speaker, while the highs are sent to the system tweet- 
ers. In short, this method has the advantage of allowing for one bass speaker 
and smaller, easier-to-build múltiple (stereo or surround) tweeters that can be 
placed in confined spaces, and helps reduce intermodulation distortion. 

Although these Systems are in wide use in home playback and theater systems, 
it's widely held that these speaker systems aren't suitable for studio playback 
due to irregularities in midrange response, image localization and overall sound. 

Monitoring configurations 

In addition to getting the best overall sound out of a mix, another monitoring 
concern that has gained importance is the need to tailor the mix to the intended 
room/speaker configuration (i.e., mono-stereo, mono-surround and stereo- 
surround). 

It's important to remember that a large percentage of your potential customers 
may first hear your mix over a TV, Computer or AM/FM radio in mono. There- 
fore, if a recording sounds good in stereo but poor in mono, it might not sell 
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as well because it failed to take these media into account. The same might go 
for a surround-sound mix of a music video or feature release film in which 
proper attention wasn't paid to phase cancellation problems in mono and/or 
stereo (or vice versa). The moral of this story is simply this: 


To prevent potential problems, a mix should be carefully included that would cancel out Instruments and poten- 

checked in all its release formats in order to ensure that it tially degrade the balance, 

sounds good and that no out-of-phase components are 


The most commonly accepted speaker configurations are mono, stereo and 
surround sound. 

Mono 

Even in this day and age, much of the buying public will first experience a mix 
in monaural (mono) sound (Figure 16.9). That's to say, they'll hear your song 
over the radio, on TV, in an elevator, on the Computer, etc. For this reason record 
companies, producers and everyone else involved in the process will often place 
a great deal of importance on mono compatibility and the overall sound of a 
mono mix. In fact, it's not uncommon for a sepárate mono mix to be made to 
ensure that it'll sound as good as it can for its intended médium. 


FIGURE 16.9 

Example of a 
monaural monitoring 
configuration. 


Ever since the practical development of the 45°/45 o record cutting process, 

stereophonic (stereo) sound (Figure 16.10) has ruled 
the turntable. Of course, over the years, stereo has 
also grown to rule FM radio, the CD player, TV and the 
Internet. For these reasons, the creation of a quality 
stereo mix is extremely important with relation to F/R 
balance, overall frequency balance, dynamics, depth 
and effects. 

It has been stated throughout this book that the mixing 
environment should be acoustically and physically sym- 
metrical (within reason) in order to ensure that the F/R 
balance, effects balance and overall imagery are accurate 
within the stereo soundfield. Beyond this, it's always 
wise to check for mono compatibility. Phase cancella- 
tions can cause instruments or frequencies in the spec- 
trum to simply disappear whenever a mix is summed to 
mono. The best tools for reducing phase errors are good 
mic technique, a phase plug-in or display and, of course, 
your ears. 
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Stereo + sub 

In actuality, 2.1 is a misnomer. Although it's a relatively 
easy title to remember, 2.1 usually represents the addi- 
tion of a powered sub to a set of stereo speakers—not 
a .1 LFE channel. Instead of providing an additional 
sonic big-bang-boom, the subwoofer is used to extend 
and to help define the music's low end. Extreme care 
should be taken when setting up such a system, as 
a loud, improperly setup sub and crossover combina- 
tion can create a low end that can be boomy and very 
inaccurate. 

Although it's not commonly known, there are two ways 
to set up a 2.1 system. Through careful adjustment and 
care, either method can work: 

■ Connect the stereo signal to the sub inputs and 
use the sub's internal crossover outputs to feed a 
bass-limited signal to the main stereo speakers 
(Figure 16.11). This traditional system will band- 
limit the stereo speakers low end and should be 
tuned carefully. 

■ The active sub (which usually has two inputs) can 
be combined with the stereo signal by using a 
simple "y" connection or summing network. This 
method will NOT band-limit the stereo speakers 
low end and should be tuned with VERY special 
care, because interference and interactions between 
the bass drivers can definitely cause inaccuracies. 

4 0 Quad 

During the 1960s and 1970s, quad was used to intro¬ 
duce music enthusiasts to the joys of surround. Although 
5.1 has dominated the field in recent decades, the use 
of a four-channel (left/right/rear-left/rear-right) sound- 
field is still relevant as a surround recording, playback 
and live performance tool (Figure 16.12). 

Surround minus an LFE 



FIGURE 16.10 

Example of a 
stereo monitoring 
configuration. 



FIGURE 16.11 


Simply put, 5.0 is 5.1 without an LFE big-bang-boom channel. Since LFE is 
band-limited to the extreme bass frequencies and is intended to provide an 
added EFFECTÜÜ, many surround professionals argüe that it's unnecessary 
to add the LFE channel to a surround music project at all. As with much 
of recording, surround is an open frontier that's totally open to personal 
interpretation. 


Example of a 2.1 
stereo monitoring 
configuration, 
showing the 
crossover 

connections through 
the sub. 


















Monitoring 



FIGURE 16.12 

Example of a quad 
monitoring configura¬ 
ron, showing a width 
placement range in 
the rear channels. 


■ Those who advócate that music should be focused in 
the front L/C/R mix, while placing the ambient and 
special effects into the rear surround field 
■ Those who prefer to leave such traditional concepts behind and look at 
the soundfield as a full 360° environment, where instruments and mix 
elements can fall anywhere in the surround soundfield. 


Surround with an LFE 

With the advent of 5.1 playback in home and audio 
"theaters," surround sound (Figure 16.13) has grown 
into a major professional and consumer entertainment 
market. The ñame refers to the five full-range channels 
that offer up a left/right/center/LFE/rear-left/rear-right 
soundfield. DVD videos dises commonly use 5.1 encod- 
ing in the form of Dolby Digital (a scheme that encodes 
the discrete 5.1 information into a single bitstream, 
known as AC-3) as well as the DTS format. 

The debate over how to mix for surround has raised the 
temperature of many a panel forum and Web chat dis- 
cussion. Those who dive into these discussions tend to 
fall into two camps: 


FIGURE 16.13 

Example of a 5.1 
surround-sound 
monitoring configura¬ 
ron, showing a width 
placement range in 
the surround 
channels. 
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When dealing with discrete 5.1, compatibility issues aren't usually a major 
problem, because it's understood that media will be played back on a discrete 
playback system. In such a situation, it's common for a sepárate stereo/mono- 
compatible mix to be built up from the soundtrack or production mix. However, 
should the mix be encoded in Dolby ProLogic (a scheme that encodes the sur- 
round information into a L/R stereo track using complex phase relationships), 
care should be taken to ensure surround, stereo and mono playback compatibil¬ 
ity. Additional and detailed information can be found in Chapter 17. 

7 Theater plus an LFE 

Within large-scale theater soundtracks, the eight-channel 7.1 format is often 
used to provide two extra (center left and center right) speakers for additional 
spatial placement and localization. 

Monitor level control 

As monitor systems grow to accommodate various production and playback 
formats, controlling the monitor level and switching can become problematic. 
Many multibus consoles and high-end DAW systems are capable of handling 
the monitoring requirements of the various formats (including surround 
sound); however, even these can fall short when múltiple sources, level trims 
and straightforward level control are taken into consideration (although newer 
interface designs are finally offering multichannel level control). For this reason, 
many have turned to using a high-quality preamp, interface or dedicated studio 
monitor management system (Figure 16.14) in order to handle the monitoring 
needs of a professional or project studio. 
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FIGURE 16.14 

Motu 828 Mk3 
surround audio 
interface control 
panel. (Courtesy of 
MOTU, Inc., www. 
motu.com.) 
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Spectral reference 

Even if your monitor speakers have perfect time/frequency response readings, 
few people who buy recordings will have a perfectly accurate reproduction 
system (if there is such a thing). As a result, they won't hear the exact mix that 
was heard in the control room. The buying public will often hear different 
frequency balances, due to response variances between the almost limitless 
types of speaker/listening room combinations. Faced with this fact, the best we 
can do as professionals is to rely on our judgment, our experience and our ears 
to create a balanced mix that'll do the best possible justice to a project under a 
wide range of listening conditions. 


FIGURE 16.15 

Software spectral 
display. (Courtesy 
of Steinberg Media 
Technologies GmbH, 
www.steinberg.net.) 
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In addition to our best set of tools—our experience, judgment and ears—a 
visual tool known as a spectral analyzer is often used to give visual cues as to an 
audio program's overall frequency balance at any point in time. These applica- 
tions (which can be either stand-alone or found in a DAW program or suite of 
plug-ins) give a visual bar readout of a signal's level at various frequencies 
throughout the audible band (Figure 16.15). Obviously, such a tool can help 
an engineer or producer to zero in on an offending or deficient frequency and/ 
or bandwidth simply by looking at the display over time. During both the record 
and mix phases, these tools can help point out and avoid potential phase and 
spectral problems. 


MONITOR SPEAKER TYPES 

To obtain the best possible compromise in sound 
balance, several alternative monitor speaker 
options are often available as a reference during 
a session and/or mix. Quite often, a consolé or 
monitor control system will letyou select between 
speaker/monitor types, with each set commonly 
having its own associated amplifier for power 
and level matching flexibility. These types inelude 
farfield, nearfield, small-speaker and headphone 
monitoring. 

Farfield monitoring 

Farfield monitors often involve large, multidriver 
loudspeakers that are capable of delivering 
relatively accurate sound at modérate to high 
volume levels. Because of their large size and 
basic design, the enclosures are generally soffit 
mounted (built into the control room wall to 
reduce reflections around and behind the enclo- 
sure and to increase overall speaker efficiency). 
This monitor type has lost popularity to the 
easier-to-use and less expensive nearfield book- 
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FIGURE 16.16 

Tannoy System 215 
DMTII main reference 
monitor. (Courtesy of 
Tannoy/TGI North 
America, Inc., www. 
Tannoy.com.) 


shelf system. Further reading on soffit design and construction can be found in 
Chapter 3. 

These large-driver systems (Figures 16.16) are often used during the recording 
phase because of their ability to safely handle high sound levels (which can 
come in handy should a microphone drop or a vocalist decide to be cute and 
scream into an open mic). They're also great for listening to a mix at loud levels 
in order to hear the impact that it'll have on the dance floor or in a souped-up 
car system. In fact, certain types of music rely on bass levels that can only be 
supplied by such a system at moderate-to-high SPLs. [Note: It's important to be 
aware of the danger of long-term exposure to such sound levels.) 

Nearfield monitoring 

Although farfield monitors are generally the best reference at high listening 
levels, few systems are equipped with speakers that can deliver "clean" sound 
at such high SPLs. For this reason, most professional and project studios use 
nearfield monitors that more realistically represent the type of listening environ- 
ment that John and Jill Q. Public will most likely have. 
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FIGURE 16.17 

Yamaha HS-80M 
studio monitor 
speakers. (Courtesy 
of Yamaha Corpora¬ 
tion of America, 
www.yamaha.com.) 


The term nearfield refers to the placement of small to medium-sized bookshelf 
speakers on each side of a desktop working environment or on (or slightly 
behind) the metering bridge of a production consolé. These speakers (Figures 
16.17 through 16.19) are generally placed at closer working distances, allowing 
us to hear more of the direct sound and less of the room's overall acoustics. 

In recent times, nearfields have become an accepted standard for monitoring in 
almost all areas that relate to audio production for the following reasons: 

■ Quality nearfield monitors more accurately represent the 
sound that would be reproduced by the average home speaker 
system. 

■ The placement of these speakers at a position closer to the lis- 
tening position reduces unwanted room reflections and reso- 
nances. In the case of an untuned room, this helps to create a 
more accurate monitoring environment. 

■ These moderate-sized speaker systems cost significantly less 
than their larger studio reference counterparts (not to mention 
the reduced amplifier cost because less wattage is needed). 

As with any type of speaker system, nearfields vary widely in both 
construction and fundamental design philosophy. It almost goes 
without saying that care should be taken when choosing the 
speaker system that best fits your production needs and personal 
tastes. 



FIGURE 16.18 

Mackie HR824mk2 
active monitor 
speaker. (Courtesy of 
Loud Technologies, 
Inc., www.mackie. 
com.) 
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FIGURE 16.19 

Digidesign/PMC RM1 
and RM2 powered 
studio monitors. 
(Courtesy of 
Digidesign, a división 
of Avid, www. 
digidesign.com.) 


Small speakers 

Because radio, televisión and Web airplay are major forces in audio production 
and help in the distribution and sales of recordings, it's often good to monitor 
your final mix through a small, inexpensive speaker set that mimics the non- 
linearities, distortion and poor bass response of those media. Such speakers can 
be either bought or easily made. Occasionally, these small speakers are incor- 
porated into consolé and two-track ATR designs for easy monitoring. 

Before listening to a mix over such small speakers (Figure 16.20), it's often a 
good idea to take a break in order to allow your ears and your brain to recover 
from the prolonged exposure of listening to higher sound levels over larger 
speakers. 

Headphones 

Headphones (Figure 16.21) are also an important monitoring tool, as they 
remove you from the room's acoustic environment. Headphones also offer 
excellent spatial positioning in that they let the artist, engineer or producer place 
a sound source at critical positions within the stereo field without reflections 
or other environmental interference from the room. Because they're portable, 
you can take your favorite headphones with you to quickly and easily check out 
a mix in an unfamiliar environment. 
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FIGURE 16.20 

AVANTONE MixCubes 
Full-Range Mini 
Reference Monitors. 
(Courtesy of Avant 
Electronics, www. 
avantelectronics. 
com.) 



FIGURE 16.21 

Sony MDR-7506 
professional dynamic 
stereo headphones. 
(Courtesy of Sony 
Electronics, Inc., 
www.sony.com/ 
proaudio.) 



It should be noted that while headphones elimínate the acoustics of a room 
from the monitoring situation, they don't always give a true representation of 
how sounds will behave through loudspeakers (especially with regard to 
imaging). Monitoring through headphones will also often emphasize low-level 
sounds like reverb and other effects more than loudspeakers in a room. As a 
result, listening to a mix over both monitor types will often be beneficial. 
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Yourcar 

Last, but not least, your (or any other) car can be a big help in determining how 
a mix will sound in one of modern society's most popular listening environ- 
ments. You might take your mix out for a spin on a basic car system, as well as 
a souped-up, window-shakin' bass bomb. 

MONITORING IN THE STUDIO 

In addition to the need for an accurate monitoring environment in the control 
or production studio, musicians often have special needs for playing back or 
monitoring the sound that's being recorded during a session. 

Headphones in the studio 

Monitoring over headphones in the studio is by far the most common way to 
monitor sound during a recording session. 

When recording, it's generally best to use sealed headphones to prevent or 
minimize the monitor feed from leaking back into the newly recorded track. 
The number of headphones will vary with the particular session requirements. 
An orchestral film overdub, for example, could easily use upward to 60 pair 
during a single session and then, might only need one pair on the next overdub. 

Such variations also place demands on the distribution of headphone power 
and the number of required feeds. As you might imagine, the power that would 
be needed to run 60 headphones is quite considerable, requiring that a power 
amplifier be used to drive any number of headphone distribution boxes through- 
out the room. On the other hand, the power that would be required to drive 
one or two headphones in a project studio might be so small that they could 
be driven from the console/mixer's internal headphone or a basic headphone 
distribution amp (Figure 16.22). 

Likewise, the need for sepárate monitor mix feeds will vary from studio to 
studio, as well as with varying session requirements. For example, a straightfor- 
ward session involving only a few musicians will most likely require a single 
headphone monitor mix. During a tracking or a more complicated session, two 
or more sepárate mixes could be made available to the musicians by sending 
sepárate "cue" feeds from the mix outputs of múltiple auxiliary sends. 

For those who wish to allow the musicians to create their own mix, headphone 
Systems (Figure 16.23) exist that can draw sepárate instrument or submix feeds 



FIGURE 16.22 

Powerplay Pro-8 
8-channel headphone 
distribution amp. 
(Courtesy of 
Behringer Int’l GmbH, 
www.behringer.com.) 
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FIGURE 16.23 

HRM-16 16 channel 
personal headphone 
mixing station. 
(Courtesy of Furman 
Sound, Inc., www. 
furmansound.com.) 



from various bus or cue feed outputs. This gives the musician complete control 
over volume, pan, and instrument mix within his or her headphones. 

Speakers in the studio 

Often, there's not enough time for the musicians to leave their instruments and 
walk into the studio to hear a playback. For this reason, studio monitors might 
be mounted in the studio for immediate playback. These are usually larger 
monitors, because they will need to be driven to levels that can adequately fill 
the studio and will often need to withstand unintentional abuse (from feed- 
back, too loud a level, etc.). 


CHAPTER17 


Surround Sound 


At a special media event that was attended by about 500 people, my good friend 
George Massenburg asked the crowd, "How many of you truly know the joys 
of surround-sound playback or production?" In that crowd, about 20 people 
(including myself) raised their hands. He then went about the business of 
playing back several world-class projects that most of the crowd was familiar 
with, and talking about the benefits of surround-sound music and audio-for- 
visual production. Right-on, George! 

Truth is, whether you're an advócate or an adversary of the concept of surround 
sound, one thing is for sure ... it exists in the here and now, and it is certain to 
play an ever-growing role in the media technologies of tomorrow. 

I think that at this point I have to break from my role as a neutral author and 
State fíat out that I'm a HUGE surround proponent. For me, the ability to 
compose and mix in surround has been an uplifting and hugely beneficial 
experience. I clearly remember as a kid, placing two álbum covers behind my 
ears and listening as the music carne to life around my head (go ahead, try it)! 
The ability to augment music and visual media by placing sounds within a 360 ° 
circle has literally opened up new dimensions in mixing and effects-placement 
technologies in a way that keeps me creatively young. 

Most of the people that I know who are ideologically closed to the idea of sur- 
round haven't worked with or listened to the médium. I urge that you keep an 
open mind to the process, watch movies and listen to music in surround (in 
any available format) and, if at all possible, take the time and effort to familiar- 
ize yourself with the process of producing sound in surround. 

Before I put my neutral writer's cap back on, I'd like to present the strongest 
argument for becoming familiar with the production techniques of surround: 
enhanced job opportunities. I have friends living in the technological heart 
of numerous city centers who are completely unfamiliar with any and all forms 
of surround. It never occurred to them that they could increase their opportu¬ 
nities, client base and perceived prestige in the fields of mixing music, 
soundtracks for movies and gaming ... by investing in a surround monitoring 
system and learning the basic tools and techniques of mixing and mastering 
media for surround. If for absolutely no other reason, the ability to understand 



Surround Sound: Past to the Present 


and work in new and upcoming technologies can help give your career a 
marketing edge. 


SURROUND SOUND: PAST TO THE PRESENT 

Of course, it all started with the movies. In the pre "talky" days movie theaters 
were anything but silent ... organs, chimes and all sorts of percussion clanged 
from all parts of the room behind órnate wall features. With the introduction 
of movie sound and the use of the musical seo re in the late 1920s, all of the 
soundtracks were played back by the delivery format of the time—mono. 

On November 13, 1940, Walt Disney's Fantasía opened up the soundfield to 
stereo when it premiered at New York's Broadway Theater. Although it wasn't 
the first film to be recorded using the "múltiple channel recording" process (that 
distinction went to the Deanna Durbin film One Hundred Men and a Girl , which 
was ultimately released by Universal in mono), Fantasía was the first to intro¬ 
duce multichannel sound to the public. 


The final mix of Fantasía was printed onto four master optical tracks for play- 
back using a special RCA system called "Fantasound." (Unlike the two-channel 
format that was adopted for home playback, film "stereo" sound started 
out with, and continúes to use, a minimum of four discrete channels.) This 
multispeaker setup placed three horns behind the screen and 65 smaller speak- 
ers around the walls of the theater. Due to the outlandish setup costs (estimated 
at about $85,000 for each theater at the time), RCA stopped making this 
fantastic system after setting another one up at the Carthay Circle Theater in 
Los Angeles. 


FIGURE 17.1 

Example of a 
Quadraphonic (Quad) 
speaker setup. 



In the early 1950s, the first commercially successful 
multichannel sound formats carne onto the scene 
with the development of Cinemascope (four-track 
35 mm) and Todd-AO (six-track 70 mm). Both of 
these formats made use of magnetic tracks that 
existed alongside the release print picture, and 
required that the projector be fitted with special 
playback heads, amps and speakers. 

In the early 1970s, the home consumer stereo 
market was gaining in popularity and audio 
quality. With the development of higher quality 
amps, speakers and record turntables carne new 
experimentations in systems design that eventu- 
ally led to the development of Quadraphonic 
Sound (Quad). This playback system made use 
of four speakers (Figure 17.1) that were placed 
in the four corners of a room, which enveloped 
the home listener in a L/R/LS/RS listening 
experience. 
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Although analog reel-to-reel and cassette tape machines were used in homes, 
they were still relatively expensive. By elimination, this meant that playback 
would have to be carried out by the most popular médium of the day—the LP 
record. 

Reproducing four channels from a record wasn't easy. Often, the task of encod- 
ing four channels onto the two walls of a vinyl record was done with relative 
phase or by using a complex, high-frequency carrier tone that was used to 
modulate the sum and difference channels. However, the real difficulties lay in 
the wide assortment of incompatible encode/decode formats that were offered 
by various manufacturers. Given the fact that your system might not play back 
the latest and greatest release, and that dises were both expensive and prone to 
deterioration over a short period of time (the high-frequency signáis on modu- 
lated records would literally wear away) ... the Quad revolution died away. 

Stereo comes to televisión 

Since its inception, surround sound had been used in motion picture soundtrack 
production with great success. With the introduction of Dolby noise reduction 
and multichannel audio in the theater, good sound was not only appreciated, 
it was expected! On the other side of the media tracks, televisión sound was 
strictly a lo-fi, mono experience up until the early 80s. Sound was strictly an 
afterthought to the visual image. However, with the adoption of the video cas¬ 
sette recorder (VCR) and later hi-fi stereo sound from a VCR, discriminating 
audiences began to appreciate the higher-quality sound that accompanied the 
almighty image. With the dawning of the music video (I want my MTV!), stereo 
broadeast televisión and the stereo VCR, TV was finally forced into an era of the 
higher-quality, multichannel, visual experience. 

Theaters hit home 

In 1982, Dolby Labs introduced "Dolby Surround," an extensión of their profes- 
sional Dolby Film Sound Project. By 1987, millions of homes were beginning 
to be fitted with consumer receivers and high-end audio Systems that were 
integrated with video. With the introduction of Dolby Pro Logic, a simple 
system was put into place that allowed phase information to be extracted from 
the two tracks of a stereo program to reproduce the L/R/C/Surround field. In 
1992, with the introduction of Dolby's AC3 surround encode/decode system 
(Dolby Digital), it became possible for discrete 5.1 surround sound to be 
encoded directly with the new visual entertainment médium of the early 21st 
century—the DVD! 

SURROUND IN THE NOT-TOO-DISTANT FUTURE 

With the recent introduction of the Acura TL car with an Elliott Sheiner (ELS) 
surround-sound DVD-A system, many in the audio world await the open accep- 
tance of surround sound into the domain of the open highways. In fact, many 
feel that the automobile, when joined with the home theater system, will greatly 
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help to propel surround into the open market mainstream. Meanwhile, video 
games and audio codees (most notably WMA9 and MP4) allow for surround- 
sound music to be encoded and distributed over the Web for personal use with 
the millions of surround Systems that are now installed on home personal 
computers. 


MONITOR NG IN 5.1 SURROUND 

With the recent proliferation of surround-sound speaker options for both the 
home and studio, options exist at all levels of quality, functionality and cost 
effectiveness for installing a surround monitoring environment. As with most 
new technologies, it's important that your existing facility be taken into account, 
so as to maximize control over monitor levels and monitor format choices 
(discrete surround, stereo and mono), as well as its integration with your current 
consolé and/or DAW system. Befo re choosing a 5.1 speaker system/setup, it 
would be wise to consider the following: 


FIGURE 17.2 

Nuendo’s Control 
Room Mixer in 
surround mode. 
(Courtesy of 
Steinberg Media 
Technologies GmbH, 
www.steinberg.net.) 



What are the commercial advantages to producing audio in surround?— 
Are there any new clients or business ventures that can make use of this 
technology? 

What is the budget for such a system? 

Can your existing speakers be integrated into the surround system?—Many 
powered monitor systems can be upgraded to surround by adding match- 
ing or similar speakers from the same product line. 

■ Can your consolé produce audio in surround 
sound?—Ifyour consolé has six or more output 
buses (8 bus +), your system can output 
surround in some manner; however, true sur- 
round panning and surround monitor control 
are often essential to surround production. 

■ Can your DAW produce audio in surround 
sound?—As above, certain DAWs are capable 
of routing audio to múltiple output buses (6 
bus +); however, most surround-capable work- 
stations are able to intégrate tracks, effeets and 
track exporting in surround with an amazing 
degree of versatility. 

■ How do I plan to monitor in surround?—If the 
consolé, interface or DAW offers true surround 
monitor capabilities, you're in luck (Figure 
17.2). If not, a hardware surround monitor 
control system or a surround preamp will be 
necessary. 

■ In what types of surround mastering tools 
should I invest?—Creating a surround-sound 
mix is only part of the battle. Often the real 
challenge comes in the mastering of the six 
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tracks into a final format that can be played on a commercial playback 
system ... when it comes to the intricacies of surround mastering of the 
various formats (dealt with later in this chapter), knowledge, attention to 
detail and patience definitely lead to power. 

Once these and other considerations have been taken into account, the task of 
choosing and installing a 5.1 surround system into the production control room 
can get under way. This can be a daunting task, requiring technical expertise 
and acoustical knowledge, or it can be a straightforward undertaking that 
requires only basic placement and system setup. 

5.1 Speaker placement 

As defined by the International Telecommunications Union (ITU), the "official" 
5.1 speaker setup is made up of five full-range monitors that are positioned in 
a circular are, with the speakers being placed at equal distances to the listener 
(at the center position). Three of the speakers are placed to the front, with the 
center speaker being placed dead center (0 o ) and the left/right speakers being 
placed at 30° ares to the center point. The surround speakers are then placed 
behind the listener at 110° ares to the center point (Figure 17.3). 



FIGURE 17.3 

The ITU “official” 5.1 
speaker setup. 
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It should be noted that the 110° are ¡n the surround field 
¡s a debatable point. During a special playback event, a 
rather noteworthy group of surround producers and 
engineers (including myself) compared a 110° are with a 


wider 130° are. We decided on the latter, with popular 
opinión being that the head and ears didn’t provide as 
much of a barrier to the wide-angle sound. You might 
want to perform a DIY and check it out for yourself. 


The full-range monitors are augmented in the bass end through the use of a 
low-pass low-frequency effeets (LFE) channel, which is then sent to the system's 
subwoofer. In film mixing, the low-pass crossover valué for Dolby's AC3 has 
been defined to be 120 Hz (although lower valúes, such as 80 Hz, are often 
chosen for use with the DTS system). 


Not all music producers and engineers agree that the LFE bass extensión channel ... definitely a subject for debate 

should be used for surround music production, because over a couple of pints o’ lager. 

it’s ¡ntended as a sub “effeets” channel and not a music 


The LFE's placement within the room isn't usually as critical, although placing 
it in a córner or in a position that's affected by a harmonic node in the room 
could greatly affect its response. If possible, the sub should be placed on the 
floor, near dead center (0 o ), at a reasonable distance from the front wall (to 
prevent excessive bass buildup). Finally, most active subs will offer full control 
over gain and crossover frequeney in order to best match the response to the 
room and main speaker set. 

From a level standpoint, all five of the surround speakers should be gain- 
adjusted to provide the same sound output level. This level matching can be 
done in many ways. For example, most surround preamps and certain surround 
monitor systems are able to output white or pink noise to the speakers at equal 
levels. These signáis are sequentially switched from speaker to speaker through- 
out the room. This allows the levels to be individually matched (either by ear 
or by using a level meter). 

Practical placement 

From a practical standpoint, I have often found surround to be far more forgiv- 
ing in placement than the above "spec" suggests. For example, it's often not 
practical to place the three front speakers in an equidistant are on most consoles 
or DAW desks because there simply isn't room. Usually, this means placing the 
speakers in a straight line (while angling the speakers for the best overall sound- 
field coverage). 
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FIGURE 17.4 

Mackie HR626 
monitor speakers. 
(Courtesy of Loud 
Technologies Inc., 
www.mackie.com.) 


Placing three matched speakers on the front bridge of a consolé, on floor/ceiling 
mounts or flush in a soffit generally isn't difficult. However, I found that match- 
ing the center speaker on a DAW desk is sometimes a challenge. This relates to 
the fact that the Computer monitor (or monitors) is commonly placed at the 
center position. If matched speakers are used, where can the center speaker be 
placed? You might be able to place the center speaker on its side. In addition, 
certain forward-thinking manufacturers make a dedicated, low-profile center 
speaker that allows the Computer monitor to be placed on top or under a shelf 
that can hold the monitor(s), as shown in Figure 17.4. 

In cases where space, budget and time are a consideration, it might not be pos- 
sible to exactly match the rear speakers to those in the front. In such cases, your 
best judgment should be used to match the general characteristics of the speaker 
sets. For example, if a company makes a speaker series that comes in various 
sizes, you might try putting a pair of their smaller monitors in the rear. If this 
isn't an option, intuition and ingenuity are your next best friends. 

ACTIVE/PASSIVE MONITORS IN SURROUND 

As you might recall from Chapter 16, active monitors inelude a powered 
amplifier(s) within their design, whereas passive monitors require that an exter- 
nal power amplifier be used to power their drivers. 

One of the results of the rise in popularity of powered speakers is the ability to 
upgrade a stereo system to a surround environment by simply adding three 
monitors to a suitably equipped production system. Often, adding extra passive 
speakers to a system adds the extra expense of having to install extra amps 
(for C/LS/RS), although consumer and professional five-channel surround 
amplifiers can be readily found on the market to round out the system. 



Surround Interfacing 


Regarding surround monitor control, it's a fortúnate fact that many of the newer 
high-end audio interfaces now inelude a master volume that can be programmed 
to control all of the channels in a surround-sound system ... a fortúnate fact, 
since dedicated hardware monitor Controls can cost in the thousands. For 
project studios on a budget that would also like the added benefit of having 
extensive control over surround monitoring, a logical choice might be to invest 
in a high-quality surround receiver system (which integrates preamp, monitor¬ 
ing and surround amp features into a single unit). Such a unit would offer 
monitoring features that are actually difficult to attain in most professional 
studios (i.e., mono, stereo, surround, ProLogic, múltiple input switching, 
record/play, dub switching, etc.). 

With regard to the LFE channel, almost all surround sub speakers are actively 
self-powered. Although many of these Systems provide active crossover outputs 
for diverting the low-frequency energy to the sub speakers, while sending the 
highs out to all five speakers (or more likely in a pro system, only the front 
stereo channels) ... extreme care should be taken when considering the use of 
this crossover network. The following should only be taken as a recommenda- 
tion when setting up your system. There are a number of powered/passive 
variables, and it's best that you fully research your options for obtaining the 
best sound: 

■ If the system is a bass managed "satellite" system (meaning that the speak¬ 
ers are not full range, but only contain mid-high drivers—while bass is 
directed to the sub), the crossover should be installed per the manufac¬ 
turéis instructions. 

■ If the surround speakers are full range, you might consider not routing 
the signáis through the active sub's crossover, because this might affect the 
natural low end of your monitors and could adversely affect the overall 
sound. 

■ You could experiment with routing the front stereo pair through the sub 
and see which way best suits your ears—as with much of audio, it's a very 
subjective subject that should be considered carefully. 

SURROUND INTERFACING 

Because surround requires that you have at least six output channels, it logically 
follows that your audio interface should be either: 

■ A dedicated audio interface with at least two inputs and six outs 

■ A multichannel audio interface (for example, having 8 ins x 8 outs). 

Over the years, a number of USB/FireWire interfaces have come onto the market 
that fully support surround sound. These devices allow audio to be easily and 
cost effectively routed from your host DAW or application to an external amp 
or active monitor system. In certain cases, these audio cards might inelude a 
driver that has extensive monitor Controls for varying setup levels, switching 
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between modes (mono, stereo, surround), inserting surround processing and 
generating setup tones—often well worth the investment. 

The various output buses of a DAW are designed to route to the first six 
outputs in a L/R/C/S/LS/RS configuration (although this scheme might change 
between DAW platforms and can be changed to fit the program and mastering 
requirements). 


SURROUND FINAL MASTERING AND 
DELIVERY FORMATS 

One of the underlying fallacies of surround-sound production is that you can 
take a consolé (or even a DAW for that matter) with at least six output buses, 
connect 5.1 speakers to their outputs, and then believe that you're ready to fully 
produce and distribute audio in surround. In fact, there are many hardware and 
software considerations that must be taken into account, including some that 
relate to the mastering of these six channels into a final format that can be 
played by both the average and the not-so-average consumer. 

This process often involves the conversión of a set of master recorded tracks or 
DAW session into a final format that can be distributed and reproduced using 
existing surround technology. Several of the production formats that are in 
current use inelude: 

■ Dolby Digital (AC3) 

■ DTS 

■ Dolby Pro Logic 

■ SRS 

■ WMA9 

■ MP4. 

Dolby Digital (AC3) 

Dolby Digital (a technical spec that's also known as AC3 ) is a popular codee 
that's used to encode digital audio into a multichannel (mono through 5.1) 
bitstream through the use of perceptual coding techniques. Because this code 
eliminates unused audio that is masked or imperceptible, the amount of data 
that's encoded within the bitstream can be greatly reduced (often by a factor of 
10), when compared to its uncompressed counterpart. 

Dolby Digital is used in the encoding of surround audio on the ever-present 
DVD, and has also been adopted for use in high-definition Blu-ray dises, HDTV 
televisión production, digital cable and satellite transmissions. The traditional 
flavor of Dolby Digital pro vides for five full-bandwidth channels in the L/R/C/ 
LFE/LS/RS format, while the LFE channel provides a low-pass channel (usually 
cutting off frequencies above 120 Hz) for special effeets and action sequences 
in movies (Figure 17.5). 
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FIGURE 17.5 

Dolby Digital Cinema 
System setup. 
(Courtesy of Dolby 
Laboratories, Inc., 
www.dolby.com.) 



left LFE center right 



Not all members of the home audience are able to listen to Dolby Digital in 
discrete 5.1, so a down-mixing feature was implemented into the codee to 
ensure compatibility with any mono or stereo playback device (meaning that 
additional tracks in other channel formats aren't necessary). In addition, the 
compressed nature of this format allows for various alternative track options 
(including soundtracks in various languages and bonus director/actor/back- 
ground tracks). 

The encoding of a six-channel audio master into a Dolby Digital encoded bit- 
stream can take place through the use of a Dolby licensed hardware encoder or 
through the use of an encoding software program or plug-in (Figure 17.6). 

Digital Theater System (DTS) is an audio encoding scheme that supports up to 
6.1 channels of discrete audio from a single datastream for use in cinema sound, 
DVD/home theater and multimedia. The DTS codee exists in several format 
flavors (depending on its intended application) and, as of this writing, these 
inelude: 

■ DTS Digital Surround: Pro vides 5.1 channels of discrete digital audio for 
commercial movie soundtracks, consumer electronics producís and soft¬ 
ware contení. 

■ DTS-ES: A digital audio format for delivering 6.1 channels of discrete 
audio in the consumer electronics market. DTS-ES is also fully backward- 
compatible with DTS decoders that aren't "Extended Surround" equipped. 

■ DTS Neo:6: Provides up to six channels of matrix decoding from stereo 
matrix material 

■ DTS 96/24: Offers 24-bit/96-kHz encoding for multichannel sound on 
DVD. 
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FIGURE 17.6 

Nuendo Dolby Digital 
Encoder plug-in. 
(Courtesy of 
Steinberg Media 
Technologies GmbH, 
a división of Yamaha 
Corporation, www. 
steinberg.net.) 


■ DTS Interactive: Delivers real-time, discrete multi- 
channel interactive playback for PlayStation2 
games when connected to a DTS-equipped device 
(such as an AV receiver). 

■ DTS Virtual: Down-converts DTS 5.1- or 6.1- 
channel soundtracks to stereo while processing a 
realistic simulation of surround sound for two- 
channel equipment. 

DTS is used to encode audio data at rates of 1.5 Mbit/ 
sec or 754 kbit/sec (compared to the basic rate of 
448 kbit/sec, or the 384 kbit/sec rate of Dolby Digital). 
Although far fewer titles have been released in DTS than 
in its Dolby Digital counterpart, the new 754 kbit/sec 
rate has allowed a number of movie production studios 
to offer soundtracks in both Dolby Digital 5.1 and DTS 
5.1 formats. Unlike Dolby Digital (which optically 
encodes the soundtrack as data blocks between the 



sprocket holes of the film), DTS stores the encoded data 

on sepárate CD-ROM dises, which are then synchronized to the film via a 
SMPTE timecode track that's printed on the film. 

Dolby Pro Logic 

A common delivery format for getting surround sound to the masses is through 
the use of Dolby Pro Logic. In its simplest form, Pro Logic (Figure 17.7) is able 


FIGURE 17.7 

In its most basic 
form, Dolby Pro Logic 
is able to derive 
surround by using 
the simple phase- 
related mathematics 
of L = left, R = right, 
L + R = center and 
L - R = surround. 
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to derive a surround soundscape from a standard pair of stereo channels. This 
is done using a phase matrix system that operates on the basic principie that: 

Left channel = Left channel information only 

Right channel = Right channel information only 

Center channel = Left + Right channels (summed information with a 3-dB 
level drop to maintain a constant mix level) 

Surround channels = Left - Right channels. The difference information 
comes from the 90° phase shifts, which are derived from the 180° dif- 
ferential between the L and R signáis. In addition, the signal is band- 
limited from 100 Hz to 7 kHz and is then encoded with a modified form 
that is similar to Dolby B-type noise reduction. 

This simple system for deriving surround information from a standard stereo 
signal is what allows us to: 

■ Listen to a movie from a regular stereo VCR tape in surround 

■ Listen to our favorite TV show in surround 

■ Listen to the radio in surround or pseudo-surround 

■ Listen to your favorite stereo CD or record in surround or 
pseudo-surround. 

Yes, you read that last one right! When Pro Logic is switched in, it's possible to 
listen to a stereo program (of any type) in surround. However, there is one catch: 
If the stereo information is totally or largely in-phase, most of the sound will 
be summed (L + R) into the center speaker. If this happens, simply take Pro 
Logic out of Circuit and listen in stereo. If the stereo information is partially 
out-of-phase (L - R, and you'd be surprised how much of music is), the sound 
will open up into the surround field in a way that lets you hear oíd standards 
in a whole new way. 

On the production side, a surround project can be encoded into Type I or II 
(Type II is able to derive a L and R surround image) Pro Logic formats, using 
either a hardware encode/decode process or software encode plug-in (when 
used with a suitable surround DAW system). Encoding a surround mix into Pro 
Logic in the hardware domain can be a tricky venture. It usually requires that 
the hardware encoder and decoder be installed directly into the recording/ 
monitor signal path. This is often required because the results from a Pro Logic 
encode/decode process can be fraught with unintentional side effects. On the 
other hand, a number of surround-capable DAW Systems are capable of encod¬ 
ing a 5.1 mix into a form that's compatible with commercial phase-related 
surround decoders in the software domain. Often, these software plug-ins can 
export an accurate Pro Logic-compatible file without the need for an expensive 
hardware encoder and decoder ... often with more accurate results. 

As the surround or rear components of Pro Logic are an intentional (and often 
unintentional) by-product of phase shifts which are applied to create a 180° 
differential between the L & R signáis, this format can be prone to compatibil- 
ity issues when summed to mono. For this reason, it's often wise to make 
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periodic checks as to Pro Logic stereo and mono compatibilities during the 
course of a mix. Although the problems are often less than one might think, 
adjustments in balance and placement might need to be made within a music 
and/or visual soundtrack mix. At times, this calis for a compromise between the 
surround effect and mono compatibility. 


SRS 

The Circle Surround format from SRS technologies also makes use of matrix 
signal processing and phase steering technology to encode and decode surround 
sound. When listening to Circle Surround encoded material, a number of 
"spatial" enhancements can be taken advantage of from a stereo or surround 
setup: 

■ The SRS TruSurround XT process makes use of signal processing and phase 
steering technology in a way that allows us to perceive pseudo-surround 
effects from a stereo speaker pair. This technology (which has its basic 
foundation in 3D) lets us hear sounds that emanate from points beyond 
the physical width of the stereo speakers, as well as from behind the lis- 
tener's head. 

■ SRS Circle Surround II allows SRS or Dolby Pro Logic encoded DVDs to 
be reproduced from a surround-sound speaker system. It also allows stereo 
sources (such as CDs and MP3s) to be spatially enhanced, allowing stereo 
program material to be heard in a pseudo-surround. 

WMA 

Through the use of plug-in encoders from various DAW manufacturers, it's pos- 
sible to encode two-channel, as well as discrete 5.1 and 7.1 surround-sound 
audio into a bitstream that can be played back using Microsoft's Windows 
Media player (as well as others). The Windows Media Audio (WMA) codee, which 
was designed for digital download and multimedia formats, has been optimized 
for streaming or download-and-play delivery at bit rates ranging from 128 kbits/ 
sec to 768 kbits/sec, at depths of 16 or 24 bits using sample rates of 44.1, 48, 
88.2 or 96 kHz (Figure 17.8). 

When set up properly, true surround sound can be played from a standard 
Windows XP Computer, using an off-the-shelf surround sound card/speaker 
system or multichannel audio interface (Figure 17.9). It's also possible to play 
back surround-encoded files from a stereo Computer or media player through 
the use of a down-mix codee that proportionately mixes the center and rear 
channels into the L-R channels. 

MP4 

MPEG-4 (commonly known as MP4) is essentially based on Apple's Quicktime 
MOV format, and is capable of containing audio, video and subtitle datastreams 
in various bitstream formats including surround sound. This popular format 
allows for the distribution of audio and video contení over download and other 
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FIGURE 17.8 

Adobe Audition 
WMA9 surround 
encoder. (Courtesy of 
Adobe Systems Inc., 
www.adobe.com.) 
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data transmission media in a way that can be 
protected using an optional digital rights man- 
agement scheme. 

Initially, MPEG-4 was aimed at being a low-bit- 
rate video médium; however, its scope was later 
expanded to be efficient across a variety of bit 
rates ranging from a few kilobits per second to 
tens of megabits per second. 

Up -mix to 5.1 

One of the more recent developments in sur- 
round technology is the ability to make use of 
Processing and spatial technology to "up-mix" 
an existing stereo soundfile into a fully discrete 
surround set of files. With such a tool (Figure 
17.10), it's possible to take an existing stereo 
soundfile and (using various width, timbre, 
spaciousness and other parameters) create and 
control a wide range of variables to create a sur- 
round mix for music, televisión or film. 



V#*lO u 


MIXING IN SURROUND 

Any device having a number of múltiple bus consoles (typically having eight or 
more buses) can be used to create a surround-sound mix, but the important 
question of the day is this: How easily can signáis be routed, panned and 
affected in a surround environment to create a 5.1 mix without going nuts with 
frustration? 

Whether you're working in an analog hardware, digital hardware or DAW "in- 
the-box" mixing environment, the ability to pan mono or stereo sources into a 
surround soundscape, place effects in the 5.1 scape and monitor múltiple 
output formats without difficulty can make the difference between a difficult, 
compromised mix and one that lifts your spirits. 


FIGURE 17.10 

TC Electronic UnWrap 
PowerCore Plug-in 
for Stereo to 5.1 
Up- and Down- 
mixing. (Courtesy of 
TC Electronic A/S, 
www.tcelectronic. 
com.) 


Surround mixers 

As you might imagine, designing an analog consolé for use with full-scale sur- 
round features is almost always a massive and costly undertaking. Digital con¬ 
soles, on the other hand, are better suited to controlling the spatial and effects 
panning functions of numerous inputs through the use of a central control 
panel. Alternatively, a number of DAW Systems offer "in-the-box" software 
mixers that can mix, process and export in surround in a powerful and cost- 
effective environment (Figures 17.11). These systems are able to mix, automate, 
effect and then export a full-surround mix into a final format in a way that's 
often a thing of beauty—and in a way that's often hard to match in the hardware 
domain ... regardless of the price. 
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FIGURE 17.11 

Surround-sound 
panners within the 
Nuendo mixer. 
(Courtesy of 
Steinberg Media 
Technologies GmbH, 
a división of Yamaha 
Corporation, www. 
steinberg.net.)Mixing 
Philosophies 
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After having mixed many projects in surround myself and in talking with 
many top engineers and producers, Ive come to realize that the first rule 
of recording—"There are no rules, only guidelines"—definitely applies to 
surround-sound mixing. 

The ways in which sound can be placed in the soundscape are as varied as those 
who place them. For example: 

■ There are those who believe that the sound should be primarily placed 
in the front (L/C/R), while the reverb, natural ambient and/or audience 
should be placed in the rear. 

■ Others feel that there literally are no rules, and that sounds can be 
placed anywhere that sound good, interesting or just plain "right" for the 
material. 

■ Others will place monophonic sounds at pinpoint positions throughout 
the field, while others will place múltiple stereo fields throughout the 
soundscape (i.e., along any of the side or córner axes). 

■ Many will not use the LFE (sub) channel at all, believing that it adds 
an unnaturalness to the low bass that's better handled by the full-range 
speakers. 
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■ Others won't use the center speaker, believing that it messes with the 
center image along the L/R axis in a way that's reminiscent of the quad- 
raphonic era. 

It's my strong belief that how you choose to convey your music through the art 
of surround-sound mixing is up to you and the muses. It's a fun médium that 
should be experimented with. 

REISSUING BACK CATALOG MATERIAL 

On a final note, one of the unintended by-products of older, classic projects 
that are being reissued into surround sound by the record companies is the 
resurrection and the rescue of their older analog (and digital) masters that are 
aging (and/or are so badly documented) to the point that restoration becomes 
a monumental task. 

Regarding efforts to transfer these tracks to 24/96 digital archive soundfiles— 
it's not uncommon to hear horror stories of 16- and 24-track masters that have 
to be reconditioned and carefully baked, then played onto an analog machine 
... only for the iron oxide to shed off the tape and sepárate from its backing as 
it plays onto the floor during the actual transfer. 

Rescue stories like these are varied and awesome (in the truest form of the 
word). Sometimes the masters have already deteriorated past the point of play- 
ability and the safety backup must be used. Others must be tracked down in 
order to find that "right take" that wasn't properly documented. Suffice it to say 
that surround reissues are definitely doing their part toward helping to keep the 
original master tracks alive and kickin' into the 21st century, not to mention the 
fact that they are breathing new life into the tacks in the form of a killer, new 
surround-sound reissue. 

In a nutshell, listening to a well-crafted surround project over killer speakers in 
the studio, home playback or a home theater system can rank way up there with 
chocolate, motorcycle ridin' and sex—absolutely the best! 
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CHAPTER18 


Mastering 


The mastering process is an art form that uses specialized, high-quality audio 
gear in conjunction with one or more sets of critical ears to help the artist, 
producer and/or record label attain a particular sound and feel befo re the 
recording is made into a finished manufactured product. Working with such 
tools (Figures 18.1 and 18.2), a mastering engineer or experienced user can go 
about the task of shaping and arranging the various cuts of a project into a final 
form that can be replicated into a salable product. 

In past decades, when vinyl records ruled the airwaves and spun on everyone's 
sound system, the art of transferring high-quality sound from a master tape to 
a vinyl record was as much of a carefully guarded art form as it was technology. 
Because this field of expertise was (and still is) well beyond the abilities of most 
engineers and producers, the field of vinyl mastering was left to a very select few. 

The art of transferring sound to a CD is also still very much an art form that's 
often best left to those who are familiar with the tools and trade of getting the 
best sound out of a project. However, recent advances in Computer and effects 
Processing technology have also made it much easier for producers, engineers 
and musicians to own high-quality hardware and software tools that are fully 
capable of creating a professional-sounding final product in the studio or on a 
desk/laptop Computer. 

THE MASTERING PROCESS 

In addition to the concept of capturing the puré artistry of a production onto 
tape, hard disk or other médium, one of the primary goals during the course of 
recording a project is the overriding concept that the final product should have 
a certain "sound." This sound might be "clean," "punchy," "gutsy" or any other 
sonic adjective that you might be striving for. Of course, once all of the cuts 
have been mixed, you'll hopefully be able to sit back and say, "Yeah, that's it!" 
If this isn't the case, having an experienced mastering engineer help you to 
"shape" the project's sonic character through the careful use of level balancing, 
dynamics and EQ could help save your sonic Technicolor day. 

Another factor that can affect a project's sound is the reality that recordings may 
have been recorded and/or mixed in several studios, living rooms, bedrooms 
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FIGURE 18.1 

Lurssen Mastering, 
Hollywood, CA. 
(Courtesy of Lurssen 
Mastering, www. 
lurssenmastering. 
com.) 



FIGURE 18.2 

Emily Lazar, chief 
mastering engineer, 
The Lodge, New York 
City. (Courtesy of 
The Lodge, www. 
thelodge.com.) 



and/or basements over the course of several months or years. This could mean 
that the cuts would actually sound different from each other. In situations 
like this, where a unified, smooth sound might be hard to attain, it's even 
more important that someone who's experienced at the art of mastering be 
sought out. 
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In essence, the process of successfully mastering can help a project: 

■ Sound "right This is often accomplished through the use of careful EQ 
matching and dynamics processing. As was previously mentioned, this 
process not only takes the right set of processing gear, but also requires 
experienced ears that intuitively know how the project will most likely 
sound under a wide range of playing conditions. 

■ Be in the right sequential order: Choosing a project's song order is an art 
form that's best done by the artist and/or producer to convey the overall 
"feel" of a project. In addition to order, the intuitive process of setting 
the gap times between songs can also make the difference between having 
awkward pauses and a project that "flows" smoothly from one cut to 
the next. 

■ Playback at optimum levels: Traditionally, the industry as a whole tends to 
set the average level of a project at the highest possible valué. This is often 
due to the fact that record companies will always want their music to 
"stand out" above the rest when played on the TV, radio, MP3 player or 
the Web. This is usually accomplished by applying compression to the 
track or overall project. Again, this is an artistic technique that often 
requires experience. Overcompression can lead to audible artifacts or a 
sound that can "squash" the life out of your hard-earned sound. In fact, 
light compression or even no compression at all is also an alternative. 
Classical music lovers, for example, often spend big bucks to hear a pro¬ 
ject's full dynamic range. 

■ Match levels throughout the project: In addition to getting the best overall 
level, it's often important that levels be properly matched when transition- 
ing from one song to the next. The goal is to improve the flow and pro- 
fessionalism of a project by keeping songs from sticking out like a sore 
thumb. 

The equipment that deais with the art and technology of creating a finished 
"master" is available in many different guises. Often, top-level mastering engi- 
neers will use specially designed EQ, dynamics and level matching gear that 
often won't commonly be found in the recording studio environment. Having 
said this, with the advent of CD and DVD burning software, dedicated hardware 
and software processing systems are now on the market that give musicians, 
producers and engineers a greater degree of control over the final mix and/or 
finished master than ever before. 

To master or not to master—was that the question? 

As mentioned, the process of mastering often requires specialized technical 
skills, audiophile equipment, a carefully tuned listening environment and 
talented ears in order to pulí a sonic rabbit out of a problematic hat or even 
one that could simply use some dressing up. 

When approaching the question of whether to have a project professionally mas- 
tered or not, it's important that you objectively consider the following questions: 
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■ Is the final mix adequate (or more than adequate) for its intended 
purpose? 

■ If not, would the project benefit most from an outside set of professional 
ears or could it be done in house? 

■ Does the budget allow for the project to be professionally mastered? 

If the decisions favor that the Services of a professional mastering house and 
engineer be sought, the next question to ask is "Who'll do the mastering?" In 
this instance, it's important that you take a long hard look at the experience 
level of the person who will be doing the job and make it your job to familiar- 
ize yourself with that person's work and personal style. In fact, it's probably best 
to borrow from the traditional business practice of finding three of the most 
appropriate mastering house/engineer facilities to bid for the job and follow 
due diligence in making your decisión by considering the following: 

■ What is their mastering track record? 

■ Are you familiar with examples of their work? If not, you should definitely 
ask for a client list and recent examples of their work ... then, have a 
critical listening session with the producer and/or band members. 

■ Are they familiar with your music genre as well as the type of "sound" 
that you're hoping to achieve? 

■ What are their hourly or project rates? Do they fit your budget? 

■ Are they willing to do a complementary test mastering session on one of 
your cuts? 

Bottom Une: Beware of the inexperienced mastering engineer (especially if that 
person is you). Once the choice of an experienced professional has been 
made, it's often wise for you/your band and the producer to personally sit in 
on the mastering session. Make sure that there are several ears around to 
listen to the project and listen over several types of systems. And above all, be 
patient, be critical of the project's sound and listen to the opinions of others. 
Sometimes you get lucky and the mastering process can be quick and painless; 
at other times it takes the right gear, keen ears and lots of careful attention to 
detail. 

“Pre”paration 

Just as one of the best ways to make the mixdown process go more smoothly 
is to fix most of your technical problems and issues BEFORE you start the 
mixdown process, one of the best ways to ensure that a mastering session has 
as few problems as possible is to ask the right questions and deal with the 
technical issues BEFORE the mastering engineer even receives your soundfiles. 
By far, the best way to avoid problems during this phase is to ask questions 
ahead of time. The mastering engineer should be willing to sit down with you 
or your team to discuss your needs and preplanning issues (or at least direct 
you to a document checklist that can help you through the preparation process). 
During this introductory "getting to know you and your technical requirements" 
session, here are just a few questions that you might ask: 
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■ What should the final master sample rate and bit rate be? 

■ What should the master's máximum level be? 

■ Should all master compression be turned off? Would you like for us to 
supply you a copy with the bus compression turned on, so you can hear 
our intended ideas? 

■ Would you like sepárate instrument/vocal stem tracks, so they can be 
treated separately? 

■ Are there any special requirements that we should be aware of? 


Important Notes to Remember 


■ It’s ALWAYS wise to consult with the project’s 
mastering engineer about the general specifications 
of the final product BEFORE beginning the mixdown 
(or possibly even the recording) process. For example, 
that person might prefer that the files be recorded 
and/or mixed at a certain bit rate and bit depth, as well 
as in a specific format. 


■ A mastering engineer might prefer that the final 
master be changed or processed as little as 
possible with regard to normalization, fade changes, 
overall dynamic changes (compression) and applied 
dither. These Processing functions are best done 
in the final mastering process by a qualified 
engineer. 


Mastering the details of a project 

From the mastering standpoint (as well as those of the artists and producer who 
are overseeing the project), a wide range of artistic decisions need to be carefully 
finessed in order to master a recording into its final, approved form. Just a few 
of the decision-making steps inelude: 

■ Choosing the proper song order 

■ The use of level changes to balance the relative track levels and improve 
the overall "feel" of the project 

■ The application of EQ to improve the sound and overall "tone" of the 
project 

■ The judicial use of dynamics to balance out the sound and increase the 
project's overall level. 

Sequencing: the natural order of things 

Whether the master is to be assembled using analog tape, or on a DAW/editor, 
the running order in which the songs of a project will be played often affeets 
the overall flow and tone of a project. The considerations for which song follows 
which is infinitely varied, and can only be garnered from experience and having 
an artistic "feel" for how their order and interactions will affect the listening 
experience. Of course, sequence decisions are probably best made by the artist 
and/or producer, as they have the best feel for the project. A number of variables 
that'll directly affect the sequenced order of a project inelude: 
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■ Total length: How many songs will be included on the disc or álbum? If 
you've recorded extra songs, could we inelude the Bonus Tracks on the 
disc? Is it worth adding a weaker song, just to fill up the CD? 

■ Running order: Which song should start? Which should cióse? What order 
feels best and supports the overall mood and intention of the project? 

■ Transitions: Altering the transition times between songs can actually make 
the difference between an awkward silence that jostles the mood and a 
transition that keeps up with the pace and feel of the project. The Red 
Book CD standard calis for 2 seconds of silence as a default setting 
between tracks. Although this is necessary befo re the beginning of the first 
track, it isn't at all the law for spacings that fall between the later songs. 
Most editors will allow you to alter the Índex spacings between tracks from 
00 seconds (butt splice) to longer gaps that help maintain the appropriate 
transitional mood. 

■ Cross-fades: In certain situations, the transition from one song to the next 
is best served by cross-fading from one track directly to the next. Such a 
fade could seamlessly intertwine the two pieces, providing the glue that 
can help convey any number of emotional ties. 

A good friend of mine, Craig Anderton, actually mapped out the up and down 
flow of a musical project over its course by using a spreadsheet. Such a docu- 
ment might indícate weak areas that could use a bit of special attention, as well 
as provide documentation of cross-fade times, Índex times, gap lengths, cross- 
fade in/outs and other information that would be extremely helpful to the 
mastering engineer. 

Relative volumes 

In addition to addressing the overall volume levels of a project, one of the tasks 
of the mastering process is to smooth out the relative volume differences 
between songs over the course of the disc or álbum. These differences could 
occur from a number of sources, including general variations in mixdown and 
program content levels, as well as levels between projeets that have been mixed 
at different studios. 

Cues as to smoothing out the relative rms and peak differences can be 
obtained by: 

■ Using your ears to fine-tune the volume levels from song to song 

■ Looking at the general attributes of a soundfile from within a digital audio 
editor 

■ Carefully watching the master output meters on a recorder or editor 

■ Watching the graphic levels of the songs as they line up in a digital audio 
editor. 

Contrary to popular belief, the use of a standard DAW normalization tool can't 
smooth out these level differences, because this process only deteets the peak 
level within a soundfile and raises the overall level to a determined valué. Since 
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the average (rms) and peak levels will often vary widely between the songs of 
a project, this toll isn't always useful, although certain editors provide normal- 
ization tools that have more variables that are more useful and in depth. 

EQ 

As is the case in the mixdown process, equalization is often an extremely impor- 
tant tool for boosting, cutting or tightening up the low end, adding presence to 
the midrange and tailoring the high end of a song or overall project. Again, EQ 
can be used as a tool to smooth out differences between cuts or for making 
changes that affect the overall character of the entire project. Of course, wide 
ranges of hardware and software plug-in EQ systems are available for applying 
the final touches both within a studio and project-based setting (Figure 18.3). 

Dynamics 

One of the most commonly used (and overused) tools within the mastering 
process relates to dynamics processing, or specifically, compression (Figures 18.4 
and 18.5). 

Although the general name-of-the-game is to achieve the highest overall average 
level within a song or complete project, care must be taken so as not to apply 
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FIGURE 18.3 

EMI TG12412 
mastering outboard 
EQ plug-in. (Courtesy 
of Digidesign, a 
división of Avid, 
www.digidesign. 
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FIGURE 18.4 

Manley Stereo 
Variable Mu® 
mastering outboard 
compressor. 
(Courtesy of Manley 
Laboratories, Inc., 
www.manleylabs. 
com.) 



FIGURE 18.5 

McDSP ML4000HD 
Mastering Plug-ins 
for Pro Tools HD, LE 
and M-Powered. 
(Courtesy of 
McDowell Signal 
Processing, LLC, 
www.mcdsp.com.) 



so much compression that the life gets dynamically sucked out of the sound 
(Figures 18.6). As with the first rule in recording—"There are no rules"—the 
amount of dynamics processing is entirely up to those who are creatively 
involved in the final mastering process. However, it's important to keep in mind 
the following guidelines: 

■ Depending on the program contení and genre, the general dynamic trend 
is toward raising the overall levels to as high a point as possible. 

■ When pushed to an extreme, compression will often have an intended or 
unintended side effect of creating a sound that has been "squashed," 
giving a "wall of sound" character that's thick (a good thing) and/or one 
that's sonically lifeless (a bad thing). 

■ When compression is not applied (or little is used), the sound levels will 
often be lower, thinner (that might be bad) or full of dynamic life (that's 
good). 
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dynamic passage 



compressed passage 


FIGURE 18.6 

Figure showing a 
dynamic passage 
and the same 
compressed passage. 



FIGURE 18.7 

The te electronic 
Master X3 multiband 
dynamic plug-in 
processor. (Courtesy 
of te electronic,www. 
tcelectronic.com.) 


From all of this, you'd be correct if you said that the process is entirely subjec- 
tive and relative! The use of compression can help add a strong presence to a 
recording, while overuse can actually kill its dynamic life ... so use it wisely! 

Multiband dynamic processing 

The modern-day mastering process often makes use of multiband dynamic 
Processing (Figure 18.7) in order to break the frequencies of the audio spectrum 
into bands that can be individually processed. Depending on the system, for 
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example, up to five distinct bands might be available for processing the final 
signal. Such a hardware or software system could be used to strongly compress 
the low frequencies of a song using a specific set of parameters, while applying 
only a small amount of compression to the sibilance at its upper end. 

File resolution 

The sample rate and bit rate resolution that a project is recorded at is a personal 
matter (as with many things in sound recording). Given that high-quality con- 
verters and a DAW with a high-quality bit-processing structure are used, many 
believe that recordings made with 16-bit/44.1-kHz file resolutions are sufficient 
to capture the full nuances of sound (for both the recorded tracks and the final 
mixdown). Others believe that rates upward to 24 bits/96 kHz are necessary to 
capture the extended high frequencies and increased resolution of music. 

If higher bit rates are chosen, it's generally best to deliver the final master record¬ 
ing to the mastering lab at the original native rate (i.e., if the session was 
recorded and mixed at 24/96, the final mixdown resolution should delivered 
to the mastering engineer at that rate). The reasoning behind this is ... in almost 
every circumstance, the mastering engineer will be able to process the mix at 
the higher bit resolution and will have better tools by which to dither the 
soundfiles down to the necessary 16/44.1 rate for final transfer to CD. 

Dither 

As was stated in Chapter 6, the addition of small amounts of randomly gener- 
ated noise to an existing bitstream can actually increase the overall bit resolution 
(and therefore low-level noise and signal clarity) of a recorded signal. Through 
the careful addition of dither, it's actually possible for signáis to be encoded at 
levels that are less than the data's least significant bit level. You heard that right 
... by adding a small amount of random noise into the A/D path, the resolution 
of the conversión process can actually be improved below the least significant 
bit level and reduce a soundfile's harmonic distortion. 

Within mastering, dither is often manually applied to soundfiles that have been 
recorded at 20- and 24-bit depths. DAW plug-ins can be used to apply dither 
to a soundfile or master mix, so as to reduce the effects of lost resolution due 
to the truncation of least significant bits. For example, mastering engineers 
might carefully experiment with applying dither to a high-resolution file before 
saving or exporting it as a 16-bit final master. In this way, noise is reduced and 
the soundfile's overall clarity is increased. 


THE DIGITAL AUDIO EDITOR IN 
THE MASTERING PROCESS 

By far, the most commonly used system for modern-day mastering is the digital 
audio editor (Figure 18.8). These two-channel and multichannel workstations 
make use of the personal computer's existing processing, disk storage and data 
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1/0 hardware to perform a wide range of audio 
editing, processing and mastering production 
tasks. These programs allow each song to be 
loaded onto its own track and be independently 
processed according to its own needs, while pos- 
sibly adding overall effects processing to the 
master output section (allowing EQ, dynamics, 
dither, etc., to be applied to the entire project 
mix). In such a graphic, on-screen environment, 
individual songs can be imported, processed 
and exported into a final form that can then be 
burned to disc—either directly from within the 
editing program or by using a CD burning soft¬ 
ware application. 

ON A FINAL NOTE 

In closing, it's important to remember, whether 
you have decided to have your latest project mas- 
tered at a professional facility or in-house, always 
ask for a proof copy(s) of the final master record- 
ing. If at all possible, take a week and listen to it 
in your car, on your boom box, home theater 
system, in another studio—virtually everywhere! 

As a musician, producer or record label, it will 

be your calling card for quite some time to come. Once you're satisfied with the 
sound of the finished product, then you can move from the mastering phase 
into making the project into a finished, salable product. 



FIGURE 18.8 

Sadie PCM2 Editing 
System. (Courtesy of 
Prism Sound Limited, 
www.sadie.com.) 
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Product Manufacture 


One of greatest misconceptions surrounding the music, visual and other media- 
related industries is the idea that once you walk out the door of a studio with 
your final master in hand, the Creative process of producing a project is finally 
over. All that you have left to do is hand the file, CD or other médium over to 
a duplication facility (or put the music on a download site) and—Ta-Dah!—the 
buying public will be clamoring for your product, website and merchandise. 
Obviously, this scenario is almost always far from 
the truth. Even before you have your blood, 
sweat and tears physically in hand, it's 
vitally important to think through and 
implement your master plan, if your 
product is to make it into the hands 
(or iPod) of the consumer. 

Early in this book, I told you about 
the first rule of recording ... that "there 
are no rules, only guidelines." This actu- 
ally isn't true. There is one rule: 

Once the recording and mix phases of a project have 

been completed (assuming that you're also doing your business and promotion 
homework as to your audience, distribution methods, live and Web marketing 
presence, production budgeting, etc.), the next step toward getting the product 
out to the people is to transform the completed song or project into a form that 
can be mass produced, distributed, marketed and SOLD. Given the various 
technologies that are available today, this could take the form of a compact disc, 
DVD, vinyl record or encoded set of MP3/FLAC files that can be distributed and 
sold on the Web. Each of these media types has its own set of manufacturing 
and distribution needs that require a great deal of attention throughout each 
step of the manufacturing and/or creation process. 

CHOOSING THE RIGHT FACILITY 
AND MANUFACTURER 

Just as each recording facility has its own unique personality and particular 
"sound," the right mastering and duplication facilities may also have a profound 


If you don’t preplan, and follouj 
through with these plans once the project 
is recorded, you can be fairly sure that your 
project ujill sit on a shelf or, ujorse, you’ll have 
a 1000 CDs sitting in your basement that’ll 
never be heard—a huge shame given the 
hard inork that ujent into making it. 



Choosing the Right Facilitg and Manufacturer 


effect on the outcome of a project. If a project is being underwritten and dis- 
tributed by an independent or major record label, they will generally be fully 
aware of their production needs and will certainly have an established produc- 
tion and manufacturing network in place. If, however, you're distributing the 
project yourself, the duty of choosing the best facility or manufacturing organi- 
zation that'll fit your budget and quality needs will fall on your or your band's 
shoulders. 

Manufacturing facilities come in two types: those that perform and offer all of 
their Services "in-house" (on the premises) and those that "outsource" (contract 
with other business or individuáis to perform various printing and pressing 
Services). Neither of these types is either all good or all bad. On one hand, in- 
house facilities are able to handle all of the phases of producing a finished 
product, from beginning to end; these facilities are often large and expensive to 
equip (meaning that the production plant might not be located nearby). On 
the other hand, manufacturers and duplicators that farm out projects may not 
have total control over their production timeline, but are often able to offer 
personalized, one-on-one Service. 

Of course, product manufacturers are located in various parts of the country 
and the world. Here are a few guidelines for finding the best one for your 
project: 

■ One of the best resources is the old-fashioned word-of-mouth. Ask people 
you know and trust about who they know and trust to manufacture their 
project. 

■ Look through the back-page ads in most music- and audio-related 
magazines. 

■ Search the Internet to compare prices and Services. 

As with any part of the production process, it's always wise to do a full back- 
ground check on a production facility and even compare prices and Services 
from at least three manufacturing houses. Give the company a cali, ask a few 
questions and try to get a "feel" for their customer Service abilities, their willing- 
ness to help with layout questions, etc. You'd be surprised about how much 
you can learn in a short time. Once you've found a few companies that seem 
to fit your needs, ask them for a promotional pack (which ineludes product and 

art samples, Service options and a price sheet). You might also 
want to ask about former customers and their contact 
information (so you can e-mail them about their 
experiences). 

Once you've settled on a manufacturer, it's 
always a good idea to research what their 
product and graphic arts needs and specs are 
before delving into this production phase. When 
in doubt about anything, give them a cali and ask; 
they are there to help you get the best possible product 


Whenever possible, it’s always 
wise and extremely important that 
you be given art proofs and test 
pressings BEFORE the final 
produets are mass duplicated. 
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(besides, asking questions helps to get the job done right the first time, with 
less muss, fuss, time and $$$). 

The absolute last thing that you or the artist wants is to have several thousand 
producís arrive on your doorstep that are ... WRONG! Receiving a test pressing 
and graphic "proof" is almost always well worth the time and money. It's never 
wise to assume that a manufacturing or duplication process is perfect and 
doesn't make mistakes. Remember, Murphy's Law can pop up at any time! 

CD MANUFACTURING 

Beyond the process of distributing audio over the Internet (using an online 
Service or from your own site), as of this writing, the compact disc (CD) is still 
a strong and viable médium for distributing music. These 120-mm silvery dises 
(Figure 19.1) contain digitally encoded information (in the form of microscopic 
pits) that's capable of yielding playing times of up to 74 or 80 minutes at a 
standard sampling rate of 44.1 kHz. 

The pit of a CD is approximately half a micrometer wide, and a standard 
manufactured disc can hold about 2 billion pits. These pits are encoded onto 
the disc's surface in a spiraling fashion, similar to that of a record, except that 
60 CD spirals can fit into the groove of a single long-playing record. These 
spirals also differ from a record in that they travel outward from the center of 
the disc, are impressed into the plástic substrate, and are then covered with a 
thin coating of aluminum (or occasionally gold) so that the láser light can be 
reflected back to a receiver. When the disc is placed in a CD player, a low-level 
infrared láser is alternately reflected and not reflected back to a photosensitive 
pickup. In this way, the reflected data is modulated so that each pit edge repre- 
sents a binary 1, and the absence of a pit edge represents a binary 0 (Figure 
19.2). Lipón playback, the data is then demodulated and converted back into 
an analog form. 



FIGURE 19.1 

The compact disc. 
(Courtesy of 
www.51 bpm.com 
and www. 

nationwidedisc.com.) 
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FIGURE 19.2 

Transitions between 
a pit edge (binary 1) 
and the absence of a 
pit edge (binary 0). 
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Songs or other types of audio material can be grouped on a CD into tracks 
known as "indexes" This is done via a subcode channel lookup table, which 
makes it possible for the player to identify and quickly lócate tracks with frame 
accuracy. 

Subcodes are event pointers that tell the player how many selections are on the 
disc and where their beginning address points are located. At present, eight 
subcode channels are available on the CD format, although only two (the P 
and Q subcodes) are used. 

Functionally, the CD encoding system splits the 16 bits of information into two 
8-bit words with error correction (that's applied in order to correct for lost or 
erroneous signáis). In fact, without error correction, the CD playback process 
would be so fragüe and prone to dropouts that it's doubtful it would've become 
a viable médium. The system then goes about translating this data (using a 
process known as eight-to-fourteen modulation or EFM) into a special code, 
known as a data frame. Each data frame contains a frame-synchronization 
pattern (27 bits) that tells the láser pickup beam where it is on the disc. This is 
then followed by a 17-bit subcode word, 12 words of audio data (17 bits each), 
8 parity words (17 bits each), 12 more words of audio, and a final 8 words of 
parity data. 

The process 

In order to transíate the raw PCM of a music or audio project into a format that 
can be understood by a CD player, a compact disc burning system must be used. 
These come in two flavors: 

■ Specialized hardware/software that's used by professional mastering 
facilities 

■ Disc burning hardware/software Systems that allow a personal Computer 
to easily and cost effectively burn CDs. 

Both system types allow audio to be entered into the system, after which the 
tracks can be assembled into the proper order and the appropriate gap times 
can be entered between tracks (in the form of Índex timings). Depending on 
the system, cuts might also be processed using cross-fades, volume, EQ and 
other parameters. Once assembled, the project can be "finalized" into a media 
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form that can be directly accepted by a CD manufacturing facility. By far, the 
most common media that's received by CD pressing plants for making the final 
master disc are user-burned CD-Recordable (CD-R) dises, although some pro- 
fessional systems will still make use of a special Exabyte-type data tape system. 


Note that not all CD-R media are manufactured using 
high-quality standards. ... In fact, some are so low in 
quality, that the project’s data integrity could be jeopar- 
dized. As a general rule: 

■ It’s always good to use high-quality “master-grade” 
CD-Rs to bum the final master (you can sometimes 
see the difference in pit quality with the naked eye). 


■ It’s always best to send two ¡dentical copies to the 
manufacturer (just in case one fails). 

■ Speaking of failing, you should check that the 
manufacturer has run a data integrity check on 
the final master to ensure that there are few to 
no errors. 


Once the manufacturing plant has received the recorded media, the next stage 
in the process is to cut the original CD master disc. The heart of such a CD 
cutting system is an optical transport assembly that contains all the optics nec- 
essary to write the digital data onto a reusable glass master disc that has been 
prepared with a photosensitive material. 

After the glass master has been exposed using a special recording láser, it's placed 
in a developing machine that etches away the exposed areas to create a finished 
master. An alternative process, known as nonphotoresist, etches directly into 
the photosensitive substrate of the glass master without the need for a develop- 
ment process. 

After the glass or CD master disc has been cut, the compact disc manufacturing 
process can begin (Figure 19.3). LJnder extreme clean-room conditions, the 
glass disc is electroplated with a thin layer of electroconductive metal. From 
this, the negative metal master is used to create a "metal mother," which is used 
to replicate a number of metal "stampers" (metal plates which contain a nega¬ 
tive image of the CD's data surface). The resulting stampers make it possible 
for machines to replicate clear plástic dises that contain the positive encoded 
pits, which are then coated with a thin layer of foil (for increased reflectivity) 
and encased in clear resin for stability and protection. Once this is done, all 
that remains is the screen-printing process and final packaging. The rest is in 
the hands of the record company, the distributors, marketing and you. 

CD burning software 

Software for burning CDs/DVDs/Blu-ray media is available in various forms 
for both the Mac and PC. These inelude the simple burning applications that 
are included with the popular operating systems, popular third-party burning 
applications and more complex authoring programs that are capable of 
editing, mastering and assembling individual cuts into a final burned master 
(Figure 19.4). 
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FIGURE 19.3 

Various phases of the 
CD manufacturing 
process: (a) the 
lab, where the CD 
mastering process 
begins; (b) once 
the graphics are 
approved, the 
project’s packaging 
can move onto the 
printing phase; 

(c) while the 
packaging is being 
printed, the approved 
master can be 
burned onto a glass 
master disc; (d) next, 
the master stamper 
(or stampers) is 
placed onto the 
production line for 
CD pressing; (e) the 
freshly stamped 
dises are cooled and 
checked for data 
integrity; (f) labels 
are then silk-screen 
printed onto the CDs; 
and (g) finally, the 
printed CDs are 
checked before being 
inserted into their 
finished packaging. 
(Courtesy of Disc 
Makers, Inc., www. 
discmakers.com.) 




There are numerous ways in which a CD project can be prepared and burned. 
For starters, it's a fairly simple matter to prepare and master individual songs 
within a project and then load them into a program for burning. Using this 
method, any program should be able to burn the audiofiles in a straightforward 
manner from beginning to end. Keep in mind that the Red Book CD standard 
specifies a beginning header silence (pause length) that's 2 seconds long. After 
this initial lead-in, any pause length can be specified; the default setting for 
silence between cuts is 2 seconds, however, any musician/producer will tell you 
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FIGURE 19.3 

Continued 


that these lengths will vary, as one song might want to flow directly into another, 
while the next might want a longer pause to help set the proper mood (it's an 
artistic "feel" kinda thing). Given these options, you could either: 

■ Manually enter the pause settings between each song into the burning 
program 

■ Manually paste silence at the end of the soundfile itself, and the enter a 
pause for all tracks, except for track 1, which has a pause length of 0 
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FIGURE 19.3 

Continued 



■ Assemble the entire mastered project into a single session (with cross- 
fades, if needed), export each cut as a sepárate track and then enter these 
soundfiles into a burning program with no pauses between the tracks. 

The advantage of creating final soundfiles that inelude silence at the end of each 
track is that they can be burned by any standard program. A CD mastering 
program, on the other hand, allows files to be assembled, edited, mastered as 
to volume, dynamics, EQ, etc. and then burned to disc with a great degree of 
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FIGURE 19.3 

Continued 





FIGURE 19.4 

DiscWelder Steel 
CD/DVD audio 
authoring tool for 
the PC. (Courtesy of 
Minnetonka Audio 
Software, Inc., www. 
minnetonkasoftware. 
com.) 


post-production flexibility. As always, careful attention to the details should 
always be made, making sure that: 

■ High-quality media is used 

■ The media is burned using a stable, high-quality drive 

■ The media is carefully labeled using a recommended marking pen 

■ Two copies are delivered to the manufacturer, just in case of data problems 
on one of the dises. 
























CD-R/RW Media 


Many CD burning programs will allow you to enter "CD Text" information 
(such as title, artist ñame/copyright and track ñame field code info) that is then 
written directly into the CD's subcode area. This can be a helpful tool, because 
important artist, copyright and track identifiers can be directly embedded within 
the CD itself and will be automatically displayed on most modern CD players. 
As a result, illegal copies will still contain the proper copyright and artist info, 
and dises that are loaded into a Computer or media player will often display 
these fields. While you're at it, you should check with the manufacturer to see 
if they're able to enter your track and project information into iTunes and CDDB 
(Gracenote CD database or FreeDB). These databases (which should not be 
confused with CD Text) allow CD titles, artist info, song titles and graphics to 
appear on most media-based music players ... an important feature for any 
artist and label. 

CD-R/RW MEDIA 

It's a given that the process of burning a CD-R (Compact Disc-Recordable) and 
the CD-RW (Compact Disc-ReWritable) on a Mac or PC has become an effective 
tool for getting music out to the general public. On the positive side, CD-R/ 
RWs give us a simple and cost-effective media for burning audio CDs for small- 
run distribution, handing music out to our friends, project reference dises, 
backing up and copying data, etc. ... On the other side of the tracks (depending 
on your business viewpoint), the CD-R has allowed for the easy copying of 
copyrighted music and other intellectual properties in a way that continúes to 
change the industry's business landscape. 

From a basic technical standpoint, the CD-R uses various photoreactive dye 
formulations that allow "virtual pits" to be burned into the surface in a way 
that creates a reflective/nonreflective datastream that adheres to the various CD 
specs (i.e., Red Book, Orange Book). This data can then be written to disc using 
a number of available writing options: 

■ Disc-at-once: This mode continuously writes the data onto a CD without 
any interruptions. All of the information is transferred from hard disk to 
the CD in a single pass, with the lead-in, program and lead-out areas being 
written to disc as an uninterrupted event. 

■ Track-at-once: This allows a session to be written as a number of discrete 
events (called tracks). With the help of special software, the disc can be 
read befo re the final session is fixated (a process that "closes" the disc into 
a final form that can be read by any CD or CD-ROM drive). 

■ Multisession: Dises written in this mode allow several sessions to be 
recorded onto a disc (each containing its own lead-in, program data and 
lead-out areas), thereby allowing data to be recorded onto the free space 
of a previously recorded CD. It should be noted that older drives might 
not be able to read this mode and will only read the first available session. 

While the altering of the data pits on a CD-R is permanent, a CD-RW can be 
erased and rewritten any number of times (often figured to be in the thousands). 
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When using a specially designed CD-RW drive (which can also burn standard 
CD-Rs), this médium type is excellent for creating data backups and media 
archiving. 

The overall quality and data longevity of CD-R/RW media is often determined 
by a number of factors, including: 

■ Media: There can be huge variations in quality between CD-R/RW media. 
Substandard dye concentrations and formulations, the use of a thin coat 
backing (or no backing at all) on the printed disc side can make the media 
more susceptible to scratches, etc. 

■ Drive: A poorly designed drive or one whose láser has gone out of calibra- 
tion with age can also cause data to be improperly written or contain 
excessive errors. 

■ Storage: Two of the CD's greatest enemies are excessive heat and direct 
sunlight. As a result, proper storage is important for keeping your archived 
data intact. 


The Web contains various sites that are devoted to 
information about these and other optical and backup 
media. They are definitely worth checking out! 

■ OSTA (www.osta.org) is an ¡nternational trade associa- 
tion that’s dedicated to promoting use of writable 


optical technology for storing Computer data and 
images. 

■ Searching Wikipedia (www.wikipedia.org) under CD-R 
or CD-RW will also provide a lot of useful facts. 


Although most manufacturing plants receive master CDs that have been burned 
onto a CD-R, it's interesting to note that many of these dises don't pass the basic 
requirements that have been set forth for creating an acceptable Red Book-Audio 
CD (the standard industry specification). Some of the problems associated with 
CD-Rs that have been burned on a desktop system inelude: 

■ Excessive data errors: This can lead to mass-produced CDs that have prob¬ 
lems when being played on older or less reliable CD players. These errors 
often crop up due to such factors as hardware/software reliability prob¬ 
lems or lack of media integrity. 

■ Dises that haven't been "closed": It's very important that the master disc be 
closed (a coding process that ensures that no other sessions or data can 
be added to the disc). Most CD mastering software packages will give you 
the option of closing or "finalizing" the disc upon burning. 

■ Multisession dises: Final master dises should never contain múltiple ses¬ 
sions (whereby music cuts or program material can be added at a later 
time to an existing CD-R). The disc should be recorded and finalized in 
the "disk-at-once" mode (meaning the disc was burned from beginning 
to end, without any interruptions in the láser burning process, and then 
is finalized). 


Rolling Your Ouin 


■ Inaccurate Índex marker points: Index markers tell the CD player where the 
tracks begin and end on a disc. If the markers are wrong, the program 
could begin early or cut off parís of a song. Interestingly, this will usually 
only happen on a more expensive, dedicated system where the Índex has 
to be manually entered). Once a disc has been cut, always listen to a disc 
to check for accurate índex markers. 

In fací, once you've checked the beginning and end marker points, it's always 
wise to critically listen to the disc from beginning to end. Never forget that 
Murphy's Law lurks around every córner! 

Burning speeds 

Whenever you see specs on a CD-R or CD-RW burner that look like "32 x 10 x 
40," the numbers indícate the various read and writing speeds of the CD drive. 
The "x" stands for the device's data transfer speed as múltiples of 150 kbits/sec. 
The first number (32 in the above example) indicates the speed that the drive 
is capable of writing data onto a CD-R disc. In the above example, the drive can 
write at transfer speeds of up to 32 x 150 kbits/sec = 4800 kbits/sec. The second 
number represents the speed that the drive can rewrite data onto a CD-RW disc 
(i.e., 10 x 150 kbits/sec = 1500 kbits/sec). The final number indicates the top 
speed at which the drive can read (i.e., 40 x 150 kbits/sec = 6000 kbits/sec). 

All of the data that's written to a CD uses Cross-Interleaved Reed-Solomon Code 
(CIRC) for error correction. CIRC is capable of applying two levels of correction: 
C1 and C2. C1 applies to bit errors that can be easily corrected by the system 
during the read process, as the data is interleaved and spread over the disc 
surface in a large are. (This is why CDs should always be cleaned in a straight 
line from the center out because a circular scratch could cause múltiple errors 
across several data frames that can't be corrected.) The larger, C2 errors apply 
to bytes in a frame that can result in uncorrectable errors or ones that can't be 
corrected using additional coding. 

Finding the optimum CD-R burning speed for your Computer or replicator is a 
topic that's best left to debate between buddies over a pint of lager. There are 
those who passionately feel that burning at lower speeds will improve the 
burning process due to improved disc stability and optimum láser performance, 
while others will argüe that performance improves at higher speeds using newer 
media dyes and improved láser assemblies. I will bow out of this debate by 
challenging you to research the data, the articles and the many message postings 
that have been dedicated to this subject. A good site for beginning your research 
is www.cdrfaq.org. 


ROLLING YOUR OWN 

With the rise of Internet music distribution and the steady breakdown of the 
traditional record company distribution system, bands and individual artists 
have begun to produce, market and sell their own music on an ever-increasing 
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scale (Figure 19.5). This concept of the "grower" selling directly to the consumer 
is as oíd as the town square produce market. By using the global Internet 
economy, independent distribution, fanzines, live concert sales, etc., sawy inde- 
pendent artists are taking matters into their own hands (or are smart enough 
to combine with the talents of others) by learning the inner workings of the 
music business. In short, artists are taking business matters more seriously in 
order to reap the fruits of their labor and craft ... something that has never been 
and never will be an easy task. 

Beyond the huge tasks of marketing, gigging and general business practices, 
many musicians are also taking on the task of burning, printing, packaging and 
distributing their own CDs from the home or business workplace. This home- 
spun strategy allows for small runs to be made in an "on-demand" basis, 
without tying up financial resources and storage space in CD inventories. 

Creating a system for burning CD-Rs for distribution can range from being a 
simple home Computer setup that creates dises on an individual basis to sophis- 
ticated replication systems that can print and burn stacks of CD-Rs or DVD-Rs 
under robot control at the simple touch of a button (Figure 19.6). 
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CD Baby is a 
distribution Service 
that helps sell 
physical CDs and 
downloadable music 
over the Internet. 
(Courtesy of CD 
Baby, www.cdbaby. 
com.) 





















CD Labeling 



FIGURE 19.6 

Neato CD Labeler kit. 
(Courtesy of Neato 
LLC, www.neato. 
com.) 


■ Use a felt-tip pen: This is the easiest and fastest 
way to label a disc. However, water-based ink 
pens should be used, because most permanent 
markers use a solvent that can permeate the 
disc surface and cause damage to either the 
reflective or dye layer. When properly done, 
this is an excellent option for archiving dises. 

■ Use a label printing kit: "Stick-on" labels (Figure 
19.6) that have been printed using specially 
designed software and an ink-jet or láser 
printer are one of the least expensive options. 
Although their design has improved over the 
years, you should be aware that some adhe- 
sives could peel off, leak over time, or contain 
solvents that might adversely affect the disc. 
This professional-looking approach is often 
excellent for use on nonarchival producís. 

■ Use a CD-R/RW drive that can "burn" a CD label onto the print side of the 
CD: If you have access to a disc and drive that uses lightscribe (www. 
lightscribe.com) or a similar technology, you can inscribe monochrome 
text and/or graphics onto a CD face. 

■ Use a CD printer: Specially designed ink-jet or láser printers are able to 
print high-quality, full-color layouts onto the face of a printable (white or 
silver-faced) disc. This is a cost-effective option for those who burn dises 
in small batch runs, but still want a professional look and feel. 


CD LABELING 

Once you've burned your own CD-R/RW or 
DVD-R/RW, there are a number of options for 
printing labels onto the newly burned dises 
(burning the disc first will often reduce data 
errors that can be introduced by dust, finger- 
prints or scratches due to handling): 


Although stand-alone programs are available, most of the above-mentioned 
printing kits and CD printers inelude a label printing program for creating and 
printing professional-looking CDs, CD books and trays (as well as labels for 
other media types). These programs let you import graphics and position text 
to create and print out personalized, professional looking labels). In addition 
to these programs, word processing and graphics templates are often available 
(most often for MS Word and Adobe graphic programs) that let you print your 
own layouts in a professional manner. 


CD AND DVD HANDLING AND CARE 

Here are a few basic handling tips for CDs and DVDs (including the recordable 
versions) from the National Institute of Standards and Technology: 
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DO: 


■ Handle the disc by the outer edge or center hole (your fingerprints may 
be acidic enough to damage the disc). 

■ Use a felt-tip permanent marker to mark the label side of the disc. The 
marker should be water or alcohol based. In general, these will be labeled 
as a non-toxic CD/DVD pen. Stronger solvents may eat though the thin 
protective layer to the data. 

■ Keep dises clean. Wipe with a cotton fabric in a straight line from the 
center of the disc toward the outer edge. If you wipe in a circle, any 
scratches may follow the disc tracks, rendering them unreadable. Use a 
CD/DVD-cleaning detergent or isopropyl alcohol to remove stubborn dirt. 

■ Return dises to their cases immediately after use. 

■ Store dises upright (book-style) in their cases. 

■ Open a recordable disc package only when you are ready to record. 

■ Check the disc surface before recording. 

DON'T: 

■ Touch the surface of a disc. 

■ Bend the disc (because this may cause the layers to sepárate). 

■ Use adhesive labels (because they can unbalance or warp the disc). 

■ Expose dises to extreme heat or high humidity; for example, don't leave 
them in direct sunlight or in a car. 

■ Expose dises to extreme rapid temperature or humidity changes. 

■ Expose recordable dises to prolonged sunlight or other sources of ultra- 
violet light. 

ESPECIALLY DON'T: 

■ Scratch the label side of the disc (it's often more sensitive than the trans- 
parent side). 

■ Use a pen, pencil or fine-tipped marker to write on the disc. 

■ Try to peel off or reposition a label (it could destroy the reflective layer or 
unbalance the disc). 

DVD AND BLU-RAY BURNING 

Of course, on a basic level, DVD burning technology has matured enough to 
be available and affordable to the general Mac and PC public. From a technical 
standpoint, these CD-drive compatible dises differ from the standard CD format 
in several ways. The most basic of these are: 

■ An increased data density due to a reduction in pit size (Figure 19.7) 

■ Double-layer capabilities (due to the laser's ability to focus on two layers 
of a single side) 

■ Double-side capabilities (which again doubles the available data size). 

In addition to the obvious benefits that can be gained from increasing the data 
density of a standard CD from 650 Mbytes to a máximum of 17 Gbytes, DVD 
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FIGURE 19.7 

Detailed relief 
showing standard 
CD and DVD pit 
densities. 



dises allow for much higher data transfer rates, making DVD the ideal médium 
for the following applications: 

■ The simultaneous decoding of digital video and surround-sound audio 

■ Multichannel surround sound 

■ Data- and access-intensive video games 

■ High-density data storage. 

As of this writing, Blue-ray disc (BD) technology has established itself as the 
video delivery media du jour. (Of course, this might be oíd hat by the time you 
read this.) Because Blu-ray uses a shorter wavelength red láser (650 nm) and 
smaller pit densities than a DVD, the capacity has risen to 25 Gbytes on a single 
disc (or 50 on a dual-layer)! 

As DVD and BD-R/RW drives have become commonplace, affordable data 
backup and mastering software has come onto the market that brought the art 
of video and Hi-Def production to the masses. Even high-level DVD media 
mastering is now possible in a desktop environment, although creating a fin- 
ished product for the mass markets is often an art that's best left to profession- 
als who are familiar with the finer points of these complex technologies. More 
information on the finer points of codee data compression and media tech¬ 
nologies relating to both CD, DVD and BD technologies can be found in 
Chapter 10 (Multimedia and the Web). 

VINYL DISC MANUFACTURE 

Although the popularity of vinyl has waned in recent years (as a result, of course, 
of the increased marketing, distribution and public acceptance of the CD and 
Online distribution), the vinyl record is far from dead. In fact, for consumers 
that range from Dance DJ hip-hopsters to die-hard classical buffs, the record is 
making somewhat of a popular comeback. However, the truth remains that 
many record pressing facilities have gone out of business over the years, and 
there are far fewer mastering labs that are capable of cutting "master lacquers." 
It may take a bit longer to find a facility that fits your needs, budget and quality 
standards, but it's definitely not a futile venture. 
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Disc cutting 

The first stage of production is the disc-cutting process. As the master is played 
from a digital source or on a specially designed tape playback machine, its signal 
output is fed through a disc-mastering consolé to a disc-cutting lathe. Here the 
electrical signáis are converted into the mechanical motions of a Stylus and are 
cut into the surface of a lacquer-coated recording disc. 

Unlike the compact disc, a record rotates at a constant angular velocity, such as 
3 3 or 45 revolutions per minute (rpm), and has a continuous spiral that 
gradually moves from the disc's outer edge to its center. The recorded time 
relationship can be reconstructed by playing the disc on a turntable that has the 
same constant angular velocity as the original disc cutter. 

The system that's used for recording a stereo disc is the 45/45 system. The record¬ 
ing Stylus cuts a groove into the disc surface at a 90° angle, so that each wall of 
the groove forms a 45 ° angle with respect to the vertical axis. Left-channel signáis 
are cut into the inner wall of the groove and right-channel signáis are cut into 
the outer wall, as shown in Figure 19.8. The Stylus motion is phased so that L/R 
channels that are in-phase (a mono signal or a signal that's centered between the 
two channels) will produce a lateral groove motion (Figure 19.9a), while out- 
of-phase signáis (containing channel difference information) will produce a 
vertical motion that changes the groove's depth (Figure 19.9b). Because mono 
information relies only on lateral groove modulation, an older disc that has been 
recorded in mono can be accurately reproduced with a stereo playback cartridge. 

DISC-CUTTING LATHE 

The main components of a vinyl disc-cutting lathe are the turntable, lathe bed 
and sled, pitch/depth control Computer and cutting head. Basically, the lathe 
(Figure 19.10) consists of a heavy, shock-mounted Steel base (A). A weighted 
turntable (B) is isolated from the base by an oil-filled coupling (C), which 
reduces wow and flutter to extremely low levels. The lathe bed (D) allows the 
cutter suspensión (E) and the cutter head (F) to be driven by a screw feed that 
slowly moves the record mechanism along a sled in a motion that's perpen¬ 
dicular to the turntable. 

CUTTING HEAD 

The cutting head translates the electrical signáis that are applied to it into 
mechanical motion at the recording Stylus. The Stylus gradually moves in a 

outside edge oí disc ■ 


FIGURE 19.8 

The 45/45 cutting 
system encodes 
stereo waveform 
signáis into the 
grooves of a vinyl 
record. 
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FIGURE 19.9 

Groove motion in 
stereo recording. 
(The solid line is 
the groove with 
no modulation.) 

(a) in-phase; 

(b) out-of-phase. 



(b) 


FIGURE 19.10 

A disc-cutting lathe 
with automatic pitch 
and depth control. 
(Courtesy of Paul 
Stubblebine 
Mastering and 
Michael Romanowski 
Mastering, San 
Francisco, CA, www. 
paulstubblebine. 
com and www. 
michaelromanowski. 
com.) 



straight line toward the disc's center hole as the turntable rotates, creating a 
spiral groove on the record's surface. This spiral motion is achieved by attaching 
the cutting head to a sled that runs on a spiral gear (known as the lead screw), 
which drives the sled in a straight track. 

The stereo cutting head (Figure 19.11) consists of a Stylus that's mechanically 
connected to two drive coils and two feedback coils (which are mounted in a 
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FIGURE 19.11 

Simplified drawing 
of a stereo cutting 
head. 


permanent magnetic field) and a Stylus heating coil that's wrapped around the 
tip of the Stylus. When a signal is applied to the drive coils, an alternating 
current flows through them creating a changing magnetic field that alternately 
attracts and repels the permanent magnet. Because the permanent magnet is 
fixed, the coils move in proportion to a field strength that causes the Stylus to 
move in a plañe that's 45 ° to the left or right of vertical (depending on which 
coil is being driven). 

PITCH CONTROL 

The head speed determines the "pitch" of the recording and is measured by the 
number of grooves, or lines per inch (lpi), that are cut into the disc. As the head 
speed increases, the number of lpi will decrease, resulting in a corresponding 
decrease in playing time. Groove pitch can be changed by: 

■ Replacing the lead screw with one that has a finer or coarser spiral 

■ Changing the gears that turn the lead screw, so as to alter the lead screw's 
rotation speed 

■ Vary the lead screw's rotation by changing the motor's speed (a common 
way to vary the program's pitch in real-time). 

The space between grooves is called the land. Modulated grooves produce a 
lateral motion that's proportional to the in-phase signáis between the stereo 
channels. If the cutting pitch is too high (causing too many lines per inch, which 
closely spaces the grooves) and high-level signáis are cut, it's possible for the 
groove to break through the wall into an adjacent groove (causing overcut) or 
for the grooves to overlap (twinning). The former is likely to cause the record 
to skip when played, while the latter causes either distortion or a signal echo 
from the adj acent groove (due to wall deformations). Groove echo can occur 
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even if the walls don't touch and is directly related to groove width, pitch 
and level. 

These cutting problems can be eliminated either by reducing the cutting level 
or by reducing the lines per inch. A conflict can arise here as a louder record 
will have a reduced playing time, but will also sound brighter, punchier, and 
more present (due to the Fletcher-Munson curve effect). Because record com- 
panies and producers are always concerned about the competitive levels of their 
dises relative to those that are cut by others, they're reluctant to reduce the 
overall cutting level. 

The solution to these level problems is to continuously vary the pitch so as to 
cut more lines per inch during soft passages and fewer lines per inch during 
loud passages. This is done by splitting the program material into two paths: 
undelayed and delayed. The undelayed signal is routed to the lathe's pitch/depth 
control Computer (which determines the pitch needed for each program portion 
and varíes the lathe's screw motor speed). The delayed signal (which is usually 
achieved by using a high-quality digital delay line) is fed to the cutter head, 
thereby giving the pitch/depth control Computer enough time to change the lpi 
to the appropriate pitch. 

Pitch is divided into two categories: coarse (which refers to spacing between 96 
and 150 lpi) and microgroove (which is between 200 and 300 lpi, or more). 
Microgroove records have less surface noise, wider frequeney range, less distor- 
tion and greater dynamic range than do coarse-pitched recordings. They can 
also be tracked with lower Stylus pressure, resulting in a longer life; however, 
the Stylus is more likely to skate across the record if the turntable isn't level. 
The playback Stylus for a stereo microgroove record must have a tip radius of 
0.7 mil or less (compared to 2.5 mils ±0.1 for coarse-groove records). Older 
78-rpm and early 33^-rpm records were recorded with a coarse pitch; however, 
virtually all current records are microgroove (having an average pitch of 265 lpi). 
At máximum pitch, the playing time of one side of a 12-inch disc, with no 
modulation in the grooves, is about 23 to 26 minutes, while the duration of a 
variable-pitch 12-inch disc cut at average levels is about 45 minutes per side. 

Recording dises 

The recording médium used on the lathe is a fíat aluminum disc that's coated 
with a film of lacquer, which is dried under controlled temperatures, coated 
with a second film, and then dried again. The quality of these dises (called 
lacquers) is determined by the flatness and smoothness of the aluminum base. 
Any irregularities in this surface (such as holes or bumps) will cause similar 
defeets in the lacquer coating. Lacquers are always larger in diameter than the 
final record, which makes it easy to handle them without damaging the grooves. 
For example, a 12-inch álbum is cut on a 15-inch lacquer and a 7-inch single 
is cut on a 10- or 12-inch lacquer. As always, it's wise to cut a reference test 
lacquer in order to hear how the recording will sound after being transferred 
to disc. 
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The mastering process 

Once the mastering engineer sets a basic pitch on the lathe, a lacquer is placed 
on the turntable and compressed air is used to blow any accumulated dust off 
the lacquer surface. A chip suction vacuum is started and a test cut is made on 
the outside of the disc to check for groove depth and Stylus heat. Once the start 
button is pressed, the lathe moves into the starting diameter, lowers the cutting 
head onto the disc, starts the spiral and lead-in cuts, and begins playing the 
master production tape. As the side is cut, the engineer can fine-tune any 
changes to the previously determined consolé settings. Whenever an analog tape 
machine is used, a photocell mounted on the deck senses white leader tape 
between the selections on the master tape and signáis the lathe to automatically 
expand the grooves to produce track bands. After the last selection on the side, 
the lathe cuts the lead-out groove and lifts the cutter head off the lacquer. 

This master lacquer is never played, because the pressure of the playback Stylus 
would damage the recorded soundtrack (in the form of high-frequency losses 
and increased noise). Reference lacquers (also called reference acetates or simply 
acetates) are cut to hear how the master lacquer will sound. 

After the reference is approved, the record company assigns each side of the disc 
a master (or matrix) number that the cutting room engineer scribes between 
the grooves of the lacquers ending spiral. This number identifies the lacquer in 
order to eliminate any need to play the record, and often carries the mastering 
engineer's personal identity mark. If a disc is remastered for any reason, some 
record companies retain the same master numbers; others add a suffix to the 
new master to differentiate it from the previous "cut." 

When the final master arrives at the plating plant, it is washed to remove any 
dust particles and then electroplated with nickel. Once the electroplating is 
complete, the nickel píate is pulled away from the lacquer. If something goes 
wrong at this point, the master will be damaged, and the master lacquer must 
be recut. 

Vinyl disc plating and pressing 

The nickel píate that's pulled off the master (called the matrix) is a negative 
image of the master lacquer (Figure 19.12). This negative image is then electro¬ 
plated to produce a nickel positive image called a mother. Because the nickel is 
stronger than the lacquer disc, several mothers can be made from a single 
matrix. Since the mother is a positive image, it can be played as a test for noise, 
skips, and other defects. If it's accepted, the mother can be electroplated several 
times, producing stampers that are negative images of the disc (a final plating 
stage that's used to press the record). 

The stampers for the two sides of the record are mounted on the top and bottom 
plates of a hydraulic press. A lump of vinylite compound (called a biscuit) is 
placed in the press between the labels for the two sides. The press is then closed 
and heated by steam to make the vinylite flow around the raised grooves of the 
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FIGURE 19.12 

The various stages 
in the plating and 
pressing process. 
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stampers. The resulting pressed record is too soft to handle when hot, so coid 
water is circulated through the press to cool it before the pressure is released. 
When the press opens, the operator pulís the record off the mold and the excess 
(called flash) is trimmed off after the disc is removed from the press. Once done, 
the disc's edge is buffed smooth and the product is ready for packaging, distri- 
bution and sales. 

PRODUCING FOR THE WEB 

In this day of surfing and streaming media off the Web, it almost goes without 
saying that the WWW has become an important marketing tool for cost effec- 
tively getting downloadable songs, promotional materials, touring info and 
liner notes out to mass audiences. As with other media, mastering for the Inter¬ 
net can either be complicated, requiring professional knowledge and experi- 
ence, or it can be a straightforward process that can be carried out from a 
desktop Computer. It's a matter of meeting the level of professionalism and 
development that's required by the site. 

In this iPod world of MP3s, Windows Media, desktop video, Internet radio sta- 
tions and who-knows-what other types of streaming media, the rule that all 
cyber-producers live by is bandwidth. Basically, the bandwidth of a media and 
delivery/receiving system refers to the ability to squeeze as much data (offen 
compressed data) through a wire, wireless or optical pipeline in as short a time 
as possible. Transmitting the highest audio and/or video feed over a limited 
bandwidth will often require specialized (and often accessible) production 
tools. Beyond this, an even more important tool is mastery of the médium, mass 
marketing and good eyes and ears for design layout and media management. 
More info on streaming media and producing for cyber space can be found in 
Chapter 10 (Multimedia and the Web). 










CHAPTER 20 


Studio Tips and Tricks 


As we near the end of this book, I'd like to take some time to offer some tips 
that can help make a session go more smoothly, both in the project and in the 
professional studio environment. Of these, one of the most important insights 
to be gained (beyond an understanding of the technology and tools of the 
trade) is the fact that there are no rules for the process of recording. This rule 
holds true insofar as inventiveness and freshness tend to play a major role in 
keeping the Creative process of making music (and music productions) alive 
and exciting. There are, however, guidelines, equipment setup and procedures 
that can help you have a smoother flowing, professional-sounding recording 
session, or at the very least, help you solve potential problems when used in 
conjunction with four of the best tools for guiding you toward a successful 
project: 

■ Preparation 

■ Attention to detail 

■ Creative insight 

■ Common sense. 

PREPARATION AND THE DETAILS 

It's important to remember that the hallmark of both a good production and a 
good production facility is their investment in the details. Basically, the glory 
goes not so much to those who simply do the job, but to those who take the 
time to get the details right. OK, let's take some time to look at some of the 
details that can help your projects shine. Probably the most important step 
toward ensuring that a recording project will become successful and marketable 
is careful preparation and planning (Figure 20.1). By far, the biggest mistake that 
a musician or group can make is to go into the studio, spend a lot of money 
and time, press a few thousand CDs, make a témplate website ... and then sit 
back and expect an adoring audience to spring out of thin air! It ain't gonna 
happen! Beyond a good dose of business reality and added experience, the 
artist(s) will have the dubious distinction of joining the throngs that have thou- 
sands of CDs sittings in their closet or basement. 
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FIGURE 20.1 

WorkirT out the kinks 
beforehand in the 
coffee shop. 

(Courtesy of M-Audio, 
a división of Avid 
Technology, Inc., 
www.m-audio.com.) 



What’s a producer and when do you need one? 

One of the first steps that can help ensure the success of a project is to seek the 
advice and expertise of those who have extensive experience in their chosen 
fields. This might inelude seeking legal council (for help and advice with legal 
matters, business contacts or both). Another important "advisor" can come in 
the form of that all-important title, producer. Basically, the producer of a project 
can fill one of two roles: 

■ The first type can be likened to a film director, in that his or her role is to 
be an artistic, psychological and technical guide that can help the band 
or artists reach their intended goals of obtaining the best possible song, 
álbum, remix, film score, etc. It's the producer's job to stand back and 
objectively look at the big picture, to offer up suggestions as to how to 
shape and guide the performance and to direct the artist or group in direc- 
tions that would result in the best possible final product. 

■ The second type also encompasses the directorial role, but also has 
the added responsibilities of being an executive producer. He or she will 
be charged with many of the business responsibilities of overall session 
budgeting, making arrangements for all studio and session activities, 
contracting (should outside musicians and arrangers be needed on the 
project), etc. This type of producer may even join with a music lawyer 
and negotiate contact relations with potential record companies or 
distributors. 

From this, you can see that a producer's role can be either limited or broad in 
scope. His or her roles should be carefully discussed and agreed on long before 
any record button is pressed. The importance of finding a producer that can 
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work best with your particular personalities, musical style and business/ 
marketing needs can't be stressed enough, and finding the right match can be 
a rewarding experience. Here are a few tips to prepare you for the hunt: 

■ Check out the liner notes of groups or musicians that you love and admire. 
You never know—their producer just might be interested in taking you 
on! 

■ Find a local up-and-coming producer that might be right for your music. 
This could help fast-track your reputation. 

■ Talk with other groups or musicians. They might be able to recommend 
someone. 

Here are just a few of the questions to ask when searching out a producer: 

■ Does he or she openly discuss ideas and altérnate paths that contribute 
to growth and better artistic expression? 

■ Is he or she a team player, or are the rules laid out in a dictator-like 
fashion? 

■ Does the producer know the difference between a Creative endeavor and 
one that wastes time in the studio? 

■ Does he or she say "Why not?" a lot more often than "Why?" 

Although many engineers have spent most of their lives with their ears wide 
open and have gained a great deal of musical, production and in-studio experi¬ 
ence, it's generally not a good idea to assume that the engineer can fill the role 
of a producer. For starters, he or she will probably be unfamiliar with the group 
and their sound, or may not even appreciate or like them! For these and other 
reasons, it's always best to seek out a producer that is familiar with you, your 
goals and your style (or is contacted early enough in the game that he or she 
has time to become familiar). 

Do you need a music lawyer? 

It's important to realize that music in the modern world is a business. Once 
you get to the phase of getting your band or your client's band out to the buying 
public, you'll quickly realize just how true this is. Building and maintaining an 
audience with an appetite for your product can easily be a full-time business— 
one where you'll encounter well-intentioned people as well as some who would 
think nothing of taking advantage of you or your client. 

Whether you're selling your producís on the Street, at gigs, on MySpace or in 
the stores through a traditional music label, it's often wise to retain the counsel 
of a trusted music lawyer. The music industry is fraught with its own special 
legal and financial language, and having someone on your side who has insight 
into the language, quirks and inner workings of this unique business can be an 
extremely valuable asset. Before we move on, it should be pointed out that many 
metropolitan areas have "Lawyers for the Arts" organizations that regularly offer 
seminars and events, as well as one-on-one consultations with artists who are 
on a tight budget and have need of legal council. 
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U) LONG BEFORE GOING INTO THE STUDIO 

As was emphasized earlier in the chapter, one of the most important steps to 
take when approaching a project that involves a number of Creative and business 
stages, decisions and risks is preparation. Without a doubt, the best way to avoid 
pitfalls and to help get you, your client or your band's project off the ground is 
to discuss and outline the many factors and decisions that will affect the creation 
and outcome of that all-important "final product." Just for starters, a number 
of basic questions need to be asked long before anyone presses the "REC" 
button: 

■ How are you going to recoup the production costs? 

■ How is it to be distributed to the public? Self-distribution? Digital down- 
load? Indy? Record company? 

■ Will other musicians be involved and what will be the cost or financial 
arrangements? 

■ Do you need a producer or will you self-produce? 

■ How much practice will you need? Where and when? 

■ Should you record it in the drummer's project studio or at a commercial 
studio? 

■ If you use the project studio and it works out, should you mix it at the 
commercial studio? 

■ Who's going to keep track of the time and budget? Is that the producer's 
job ... or will he or she be strictly in charge of Creative and contact 
decisions? 

■ Are you going to need a music lawyer to help with contacts and contracts? 

■ When should the artist or group's artistic and financial goals be discussed 
and put down on paper? (Of course, it's always best to discuss budget 
requirements and possible rewards as early as possible in the game!) 

These are but a few of questions you should ask before tackling a project. Of 
course, they'll change from project to project and will depend on the final 
project's scope and purpose. However, in the final analysis, asking the right 
questions (or finding someone who can help you ask the right questions) can 
help keep you from having to store 1000 unsold CDs in your basement. 

Now that you've asked the questions, here's a to-do list of tasks that are often 
wise to tackle well before going into the studio: 

■ Create a "mission statement" for yourself/your group and the project. This 
can help clue your audience into what you are trying to communicate 
through your art and music, and can greatly benefit your marketing goals. 
For example, you might want to answer such questions as these: Who am 
I/who are we? What are your musical goals? How should the finished 
project sound? What emotions should it evoke? What is the budget for 
this project? How will it be sold? What are the marketing strategies? 

■ Practice, practice and more practice. Need I say more ... and while you're 
at it, you might want to record your practices to get used to the process 
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(some of these tracks could be used on your website as bonus tracks and 
for "making of" music videos). 

■ Start working on the project's artwork, packaging and website ASAP. Do 
you want to tackle this yourself, hire a professional or a qualified friend 
who wants to help or could use some extra $$$? 

■ Copyright your songs. Government forms are readily available for the 
copyrighting (identifying and protecting the intellectual property) of your 
music. Form PA is used for the registration of "published or unpublished 
works of the performing arts. This class ineludes works prepared for the 
purpose of being "performed" directly before an audience or indirectly 
"by means of any device or process." Works of the performing arts inelude 
(1) musical works, including any accompanying words; (2) dramatic 
works, including any accompanying music; (3) pantomimes and choreo- 
graphic works; and (4) motion pictures and other audiovisual works." ... 
In short, it is used to copyright a work that is intended for public perfor¬ 
mance or display. Form SR is used for the "registration of published or 
unpublished sound recordings. It should be used when the copyright 
claim is limited to the sound recording itself, and it may also be used 
where the same copyright claimant is seeking simultaneous registration 
of the underlying musical, dramatic, or literary work embodied in the 
phonorecord." ... In other words, it is used to copyright a recording, while 
also protecting the underlying performance that is recorded onto the 
media. These and other forms can be found at www.copyright.gov/forms 
or by searching the Library of Congress at www.loc.gov. Again, this might 
be a good time to discuss matters with a music lawyer. 

■ Should you wish to use the Services of a professional studio during the 
recording and/or mixdown phase, it's ALWAYS wise to take the time to 
check out several studios and available engineers. Take the time to listen 
to tracks that have come out of the studio, as well as those that have been 
recorded by the engineer. Finding out which one best fits your style, 
budget and level of professionalism is an important and potentially 
expensive decisión ... the time taken to find the best match could be the 
difference between a happy or potentially disastrous experience. 


BEFORE GOING INTO THE STUDIO 

Before beginning a recording session (possibly a week or more before), it's 
always good to mentally prepare yourself for what lies ahead by creating a basic 
checklist that can help answer what type of equipment will be needed, the 
number and type of musicians/instruments, their particular miking technique 
(if any) and where they'll be placed. The best way to do this is for you, your 
group and the producer (if there is one) to sit down with the engineer and 
discuss instrumentation, studio layout, musical styles and production tech- 
niques. This meeting lets everyone know what to expect during the session and 
lets everyone become familiar with the engineer, studio and staff. This is time 
well spent, because it will invariably come in handy during the studio setup 
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and will help get the session off to a good start. The following tips can also be 
immensely valuable: 

■ If there's to be no producer on the project, it's often wise to pick (or at 
least consider picking) a spokesman for the group who has the best 
production "chops." He or she can then work closely with the engineer 
to create the best possible recording and/or mix. 

■ Record your songs during live gigs or rehearsals. It doesn't matter if you 
record them professionally or not; however, keeping the "always press the 
record button" adage in mind, if the setup meets basic professional stan- 
dards, it's always possible to import all or part of a "magical" take into 
the final project. 

■ You might want to audition the session's song list before a live audience. 

■ Work out all of the musical and vocal parís before going into the studio. 
Unrehearsed music can leave the music standing on shaky ground; 
however, leave yourself open to exploring new avenues and surprises that 
can be the lifeblood of a magical session. 

■ Try to leave plenty of time for laying down the final vocal tracks. Many a 
project has been compromised by spending too much time on "tracking" 
the basic instruments and then running short on time to lay down the 
vocals. This almost always leads to increased tensions and a compromised 
vocal track. 

■ Rehearse more songs than you plan to record. You never know which 
songs will end up sounding best or will have the strongest impact. 

■ Again, meet with the engineer beforehand. Take time for the producer 
and/or group to get to know him or her, so you'll both know what to 
expect on the day of the session. 

■ Prepare and edit any sequenced, sampled or prerecorded material before¬ 
hand. In short, be as prepared as possible. 

■ Try working to a metronome (click track) if timing is an issue. 

■ Make sure that the instruments are in good working condition, bring new 
strings, etc. 

■ Create a checklist of all of the small but important details that can make 
or break a session, for example, extensión cords, tuners, instrument cords, 
drum oil, drum tuning lugs, your favorite good luck charm, comfortable 
jammies ... you ñame it! 

■ While we're on the subject of small, but important details, make sure to 
bring cables that are in good working order ... and don't forget extras. 

■ Take care of your body. Try to relax and get enough sleep before and 
during the session. Eat the foods that are best for you (you might bring 
some health foods, fruits and plenty of liquids to keep your energy up). 
Be aware of your energy levels, so that hypoglycemia doesn't become 
a factor. 

■ Don't fatigue your ears before a session; keep them rested and clear. 

In addition to the above, it's always wise to plan out the session's technical and 
musical arrangement so as to most efficiently budget your time. 
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Again, it's always wise to confer with the producer and/or engineer about the 
way in which the musicians are to be tracked. Will the tracks be recorded in a 
traditional multitrack fashion (with the basic rhythm tracks being laid down 
first, followed by overdub and sweetening tracks and finishing with vocals) ... 
or is there a layout that would work best for your particular group's taste and 
organizational way of doing things? Communicating these details to all those 
involved in the session will help smooth the studio setup process and keep a 
basic game plan that can help keep the session reasonably on track. 

©SETTINGUP 

Once the musicians have shown up at the studio, it's extremely important that 
all of the technical, musical, and emotional preparation be put into practice in 
order to get the session off to a good start. Here are a few tips that can help: 

■ Show up at the studio on time or reasonably early. At some studios, the 
billing dock starts on time (whether you're there or not). Ask about their 
setup policies: Is there another session before yours? Is there adequate 
setup time to get prepared? Are there any charges for setup? What is the 
studio's cancellation policy in case of illness or unforeseen things go 
wrong? 

■ Use new strings, chords, drumsticks and heads, and bring spares. It's also 
a good idea to know the location, phone number and hours of a local 
music store, just in case. 

■ Tune up before the session starts, and tune up regularly thereafter. 

■ Don't use new or unfamiliar equipment (musical, hardware-wise or espe- 
cially software-wise). Taking the time to troubleshoot or become familiar 
with new equipment and software can cost you time and money. The 
frustration could even result in a lost vibe, or worse! If you must use a 
new toy or tool, it's best to learn it beforehand. 

■ Take the time to make the studio a comfortable place in which to work. 
You might want to adjust the lighting to match your mood ring, lay down 
a favorite rug and/or bean bag, turn on your lava love light or put your 
favorite stuffed toys on the furniture. Within reason, the place is yours to 
have fun with! 

©SESSION DOCUMENTATION 

There are few things more frustrating than going back to an archived session 
and finding that no information exists as to what instrument patch, mic type, 
or outboard effect was used on a DAW session (or any session, for that matter). 
The importance of documenting a session in a sepárate written document or in 
the notepad apps within a DAW can't be overemphasized. The basic documen- 
tation that relates to the who, what, where and when of a recording, mixdown, 
mastering and duplication session should inelude such information as: 

■ Artists, engineers and support staff who were involved with the project 

■ Session calendar dates and locations 
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■ Session tempo 

■ Participants in the project and important dates (for future credits) 

■ Mic choice and placement (for future overdub reference) 

■ Outboard equipment types and their settings 

■ Plug-in effects and their settings or general descriptions (you never know 
if they'll be available at a future time, so a description can help you to 
duplicate it with another app). 

To ease this process, the artist or producer might pulí out a camera or camera- 
phone and start snapping pictures to document the event for prosperity as 
well as for technical reasons. Also, it's often a wise idea to bring along a high- 
quality video camera to quietly document the setup or actual session for your 
fans as extra "behind-the-scenes" online video content such as MySpace or 
YouTube. 

The more information that can be archived with a session (and its backups), 
the better the chance that you'll be able to duplicate the session in greater detail 
at some point in the future. Just remember that it's up to us to save and docu¬ 
ment the music of today for the fans and future playback/mix technologies of 
tomorrow. Basic session documentation guidelines can be found in the Guide- 
lines & Recommendation section at www.grammy.com/Recording_Academy/ 
Producers_and_Engineers. 

©RECORDING 

It's obviously a foregone conclusión that no two recording sessions will ever be 
exactly alike. In fact, in keeping with the "no rules" rule, they're often radically 
different from each other. During the recording session, the engineer watches 
the level indicators and (only if necessary) Controls the faders to keep from 
overloading the media. It's also an engineer's job to act as another set of pro- 
duction ears by listening for both performance and quality factors. If the pro¬ 
ducer doesn't notice a mistake in the performance, the engineer just might catch 
it and point it out. The engineer should try to be helpful and remember that 
the producer and/or the band will have the final say, and that their final judg- 
ment of the quality of a performance or recording must be accepted. 

From the musician's standpoint, here are a few additional pointers that can help 
the session go more smoothly: 

■ It's always best to get the right sound and vibe onto tape or disk during 
the session. If you need to do another take, do it! If you need to change 
a mic, change it. Getting the right sound and vibe onto the track will 
almost always result in less stress and a better final product than trying to 
"Fix in the mix" at a later time. 

■ Know when to quit! If you're pushing too hard or are tired, it'll often 
show. 

■ Technology doesn't always make a good track; feeling, emotion and musi- 
cality do. 
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■ Beware of adding new parís or tracks onto a piece that doesn't need it. 
Remember, too much is simply too much! Musicians and techno-geeks 
alike often don't know when to say "it's done ... let's move on." 

■ Leave plenty of time for the vocal track(s). It's not uncommon for a group 
to spend most of their time and budget on getting the perfect drum or 
guitar sound. It takes time and a clear focus to get the vocals right. 

■ If you mess up on a part, keep going, you might be able to fix the bad 
part by punching-in. If it's really that bad, the engineer or producer will 
hopefully stop you. 

In his EQ Magazine article "The Performance Curve: How Do You Know Which 
Take Is the One?" my buddy Craig Anderton laid out his experiences of how 
different musicians will deal with the process of delivering a performance over 
time. Being in front of a mic isn't always easy, and we all deal with it differently. 
Here's a basic outline of his findings: 

■ Curves up ahead: With this type of performer, the first couple of takes are 
pretty good, then start to go downhill before ramping back up again, until 
they hit their peak before going downhill really fast. 

■ The quick starter: This type starts out strong and doesn't improve over time 
in later performances. Live performers often fall into this category, because 
they're conditioned to get it right the first time. 

■ The long ramp-up: These musicians often take a while to warm up to a 
performance. After they hit their stride, you might get a killer take or a 
few great ones that can be composited together into the perfect take. 

■ Anything goes: This category can vary widely within a performance. Often, 
snippets can be taken from several takes into a single composite. You want 
to record everything with this type of performer, because you just never 
know what gem (or bad take) you'll end up with. 

■ Rock steady: This one represents the consummate pro who is fully practiced 
and delivers a performance that doesn't waver from one take to the next; 
however, you might record several takes to see which one has the most 
feeling. 

From the above examples, we can quickly draw the obvious conclusión that 
there are all types of performers, and that it takes a qualified and experienced 
producer and/or engineer to intuit just which type is in front of the mic—and 
to draw the best possible performance from him or her. 


6 MIXDOWN 

Many of the same rules of preparation and taking good care of your ears and 
yourself apply during the session's mixdown phase. Here are a few tips: 

■ Regarding monitoring, it's often a good idea to use reference monitors 
that you trust. As such, it's common practice for a mix engineer and/or 
artist to request their favo rite speakers or to bring their own into the studio 
for the final mix. 
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■ Unlike during the 1970s, when excruciatingly high SPLs tended to be the 
rule in most studios, recent decades have seen the reduction of monitor 
levels to a more modérate 75 to 90 dB SPL. A good rule of thumb is that 
if you have to shout to communicate in a room you're probably monitor- 
ing too loudly. From time to time, you can always jack it up to 11 to check 
the mix at higher volumes. 

■ Listen on several speaker types ... at home, in your car, on your iPod/ 
phone. Jot down any thoughts and comments that might come in handy, 
should you need to go back and make adjustments. 

® BACKUP AND ARCHIVE STRATEGIES 

The phrase "nothing lasts forever" is especially true in the digital domain of lost 
l's and O's, damaged media, dead hard drives and lost data ... you know, the 
"Oh @#$%! factor." It's a basic fact that you never quite know what lies around 
the techno bend—and, of course, it's extremely important that you protect 
yourself as much as is humanly possible against the inevitable. Of course, the 
answer to this digital dilemma is to back up your data in the most reliable (or 
redundant) way possible. Hardware and program software can (usually) be 
replaced; on the other hand, when unbacked valuable session soundfiles are 
lost, they're lost! 

Backing up a session can be done in several ways, depending on the level of 
certainty that the files can be played back in the future with a minimum of 
hassle. Here are a few tips that can help you avoid the loss of data: 

■ As you might expect, the most straightforward backup system is to copy 
the session data, in its entirety, to the most appropriate media. 

■ In the longer run (5+ years), the most ironclad way to back up the track 
data of a session is to print each track as its own .wav or .aif file. Each 
track should always be recorded or exported as a contiguous file that flows 
from the beginning of the session (00:00:00:00 or appropriate beginning 
point) to the end of that particular track. In this way, the individual track 
files could be loaded into any type of DAW, at the beginning point, for 
Processing and mixdown. 

■ In such a track-by-track safety restoration situation, you might want to 
save two copies of a track that has a particular effect—one that contains 
the original, unaltered sound and one that contains the effected signal. 

■ Those who want additional protection against the degradation of unproven 
digital media may also want to back each track (or group of tracks) to the 
individual tracks of a multitrack analog recorder. 

■ For those sessions that contain MIDI tracks, you should always keep these 
tracks within the session (i.e., don't delete them). These tracks might come 
in very handy during a remix or future mixdown. 

■ Speaking of MIDI ... it's always wise to export all of the MIDI tracks/data 
within a session as a standard MIDI file. You should save all of the tracks 
as a type 1 file (where all of the multichannel track/data information is 
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left intact) and, whenever possible, save it as both a type 0 and a type 1 
(you never know what file format obstacles might haunt you in the 
future). 

■ Whenever possible, make múltiple backups and store them in sepárate 
locations. Having a backup copy in your home as well as in the studio 
can save your proverbial butt in case of a fire or any other unforeseen 
situation. Remember the general backup rule of thumb: Data is never truly 
backed up unless it's saved in three places! 

©HOUSEHOLD TIPS 

Producers, musicians, audio professionals and engineers spend a great deal of 
time in the control room and studio. It only makes sense that this environment 
should be laid out in a manner that's aesthetically, functionally and acoustically 
pleasing from a feng shui point of view. Creating a good working environment 
that's conducive to making good music is the goal of every professional and 
project studio owner. Beyond the basics of creating a well-designed facility from 
an acoustic and electronic standpoint, a number of basic concepts should be 
kept in mind when building or designing a recording facility, no matter how 
grand or humble. Here are a few helpful hints: 

■ Given the fact that an engineer spends a huge amount of time sitting on 
his or her bum, it's always wise to invest in both you and your clients' 
posture and creature comforts by having comfortable, high-quality chairs 
around for both the production team and the musicians (Figure 20.2). 

■ Velero™ or tie-straps can be used to organize studio wiring bundles into 
groups that can be laid out in a way that reduces clutter, improves orga- 
nization (color-coded straps can be used) and makes the studio look more 
professional. 


FIGURE 20.2 

The venerable 
Hermán Miller 
Aeron® chair. 
(Courtesy of Hermán 
Miller, Inc.; www. 
hermanmiller.com.) 
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■ Most of us are guilty of cluttering up our workspace with unused gear, 
papers ... you know, junk! I know it's hard, but a clean, uncluttered 
working environment tells your clients a lot about you, your facility and 
your work habits. 

■ Unused cables, adapters and miscellaneous stuff can be sorted into storage 
boxes and stacked for easy storage. 

■ Important tools and Ítems that are used every day (such as screwdrivers, 
masking tape or markers) can be stored in a rack-mounted drawer that 
can be easily accessed without cluttering up your space ... don't forget to 
pack a reliable LED flashlight (your phone's screen display might also 
work in a pinch). 

■ Portable label printers can be used to identify cable runs within the studio, 
identify patch points, I/O strip instrumentation ... you ñame it. 


(£) MUSICIANS’ TOOLS 

By now it's probably painfully obvious to most musicians that producing the 
music is only the first step toward building a career in the business. It takes hard 
work, perseverance, blood, sweat, tears and laughter. For every person who 
makes it, a large number don't. There are a lot of people waiting in line to get 
into what is perceived by many to be a glamorous biz. So, how do you get to 
the front of the line? Well, folks, here are some keys to help you through the 
golden gate. 

■ A ton of self-motivation 

■ Good networking skills 

■ A good, healthy attitude 

■ The realization that "showing up is huge!" 

The business of art (the techno-arts of recording and music production being 
no exception) is one that's generally reserved for self-starters. Even if you get a 
degree from XYZ College or recording school, there's absolutely no guarantee 
that a label will be knocking on the door with a contract in hand (if they do, 
get a lawyer, quick!). It takes a large dose of perseverance, talent and personal- 
ity to make it. In fact, one of the best ways to get into the biz is to get down on 
your knees and knight yourself on the shoulder with a sword (figuratively or 

literally—it doesn't matter) and say: "I am now a_!" Whatever it is 

you want to be, simply become it ... Shazammm! Make up a business card, 
start a business, begin contacting artists to work with or make the first step 
toward becoming the artist you want to be. 

There are many ways to get to the top of your own personal mountain. You 
could get a diploma from a school of education or the school of hard knocks 
(it usually ends up being from both), but the goals and the paths are up to you. 
Like a mentor of mine says: "Failure isn't a bad thing ... not trying is!" 

Another part of the success equation lies in your ability to network with other 
people. Like the venerable expression says: "It's not [only] what you know ... 
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it's WHO you know." Maybe you have an únele or a friend in the business, or 
a friend who has an únele—you just never know where help might come from 
next. This idea of getting to know someone who knows someone else is what 
makes the business world go around. Don't be afraid to put your best face 
forward and start meeting people. If you want to play gigs around your región 
(or beyond), get to know a promoter or venue manager and hang out without 
being in the way. You never know—the music maven down the Street might 
know someone who can help get you in the proverbial door. The longer you 
stick with it, the more people you'll meet, thus making a bigger and stronger 
network than you thought could be possible. 

Like a cióse buddy of mine always says, "Showing up is huge!" It's the wise 
person who realizes that being in the right place at the right time means being 
at the wrong place hundreds and hundreds of times. You just never know when 
lightning is going to strike—just try to be standing in the right field when it 
does. 

Here are some more practical and immediate tips for musicians: 

■ Build a personal and/or band website: Making a site on www.MySpace.com 
or creating your own personal site helps to keep the world informed of 
your gigs, projeets, bio and general goings-on. 

■ Build a relationship with a music lawyer: Many music lawyers are open to 
building relations that can be kicked into gear at a future time. Take the 
time to find a solicitor who is right for you. Does he or she understand 
your personal music style? If you don't have the bucks, is this person 
willing to work with you and your budget, as your career grows? 

■ The same questions might be asked of a potential manager. This symbiotic 
relationship should be built with care, honesty and safeguards (which is 
just one of the many reasons you want to know a music lawyer). 

■ Copyright your music: Always protect your music by registering it with the 
Library of Congress. It's easy and inexpensive and can give you peace of 
mind about knowing that the artistic property that you're sending out into 
the world is protected. Go to www.copyright.gov for more information 
(www. copyright.gov/ forms). 

■ On a personal note as a musician, I've come to realize that making music 
is about the journey—not the goal of being a star, or being the big man 
on campus. It's about the friendships, the collaborations, good and bad 
times at gigs.... To me, it's about making music. 


QS> RECORD YOUR OWN CONCERTS AND PRACTICES 

As a musician, I've found that recording my concerts and putting them up on 
the Web helps in many ways. 

■ Freely distributing concerts on your site helps to promote your music 
and provides a degree of goodwill that can go a long way with your 
fans. 


Protect Your Investment 


■ These recordings can be really helpful as a business card promo tool, in 
that a link can be sent to potential venues, allowing booking agents to 
hear and appreciate your music firsthand. 

■ These recordings can also be helpful as learning tools for the band and 
yourself. In general, they don't lie, and can help to point out shortcomings 
in your performance. 

This process doesn't have to be involved. For example, connecting a portable 
recorder to your audio interface or house mixer and hitting the record button 
before going on stage will often do the trick. 

© PROTECT YOUR INVESTMENT 

When you've spent several years amassing your studio through hard-earned 
sweat-equity and bucks, it's only natural that you'll want to take the necessary 
precautions to protect your investment. 

Obviously, the best way to protect your data is through a rigorous and straight- 
forward backup scheme (the general rule is that something isn't backed up 
unless it's saved in three places—preferably with one of the backups being 
stored off-site). However, you'll also want to take extra steps to protect your 
hardware and software investments as well, by making sure that they're properly 
insured. 

The best way to start the process of properly insuring your studio is to contact 
your trusted insurance agent or broker. ... If you don't have one, now's the time 
to get one. You might get some referrals from friends or people in your area 
and give them a cali, set up some appointments and get several quotes. 

If you haven't already done so, sit down and begin listing your equipment, their 
serial numbers and replacement valúes. Next, you might consider taking pie- 
tures or a home movie of your listed studio assets. These steps will help your 
agent come up with an adequate replacement plan and will come in handy 
when filing a claim, should an unfortunate event occur. Being prepared isn't 
just for the Boy or Girl Scouts. 

© PROTECT YOUR HARDWARE 

One of the best ways to ensure against harmful line voltage fluctuations (both 
above and below their nominal power levels) is to use an adequately powered 
uninterruptible power supply (UPS). In short, a quality UPS works by using a 
regulated power supply to constantly charge a rechargeable battery or bank of 
batteries. This battery supply is again regulated and used to feed sensitive studio 
equipment (such as a Computer, bank of effeets devices, etc.) with a clean and 
constant voltage supply. 

Just as the "uninterruptible" part of the ñame implies, should the power be 
momentarily interrupted or give out altogether, a good UPS can draw on its 
battery supply to see you through a momentary power loss or to give you 
enough time to safely shut your system down without losing data during a total 
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power failure. The biggest concern here is to make sure that you buy a UPS that 
has enough power reserves to adequately and continuously power the equip- 
ment during normal operation, and will give you enough time to save your 
session and/or file data and then safely shut the system down during an outage. 
In short, make sure the UPS has a continuous power rating that's high enough 
for your supply needs. 

0 UPDATE YOUR SOFTWARE 

Periodic software updates might help to solve some of those annoying problems 
that you've been dealing with in the studio. Many times, the software that's been 
pressed onto a commercially available CD or DVD will be out of date by the 
time it reaches you. For this reason, it's good to check the Web regularly to see 
if there's a newer versión that can be loaded at the outset or periodically over 
the course of the software's life. 

0 A WORD ON PROFESSIONALISM 

Before we cióse this chapter, there's one more subject that I'd like to touch on— 
perhaps the most important one of all— professional demeanor. Without a doubt, 
the life and job of a typical engineer, producer or musician isn't always an easy 
one. It often involves long hours and extended concentration with people who, 
more often than not, are new acquaintances. In short, it can be a high-pressure 
job. On the flip side, it's one that's often full of new experiences, with demands 
that change on almost a daily basis, and often involves you with exciting people 
who feel passionately about their art and chosen profession. 

It's been my observation (and that of many I've known) that the best qualities 
that can be exhibited by anyone in "The Biz" are: 

■ Having an innate willingness to experiment 

■ Being open to new ideas (flexibility) 

■ Having a sense of humor 

■ Having an even temperament (this often translates as patience) 

■ Being willing to communicate with others 

■ Being able to convey and understand the basic nuances of people from 
all walks of life and with many different temperaments. 

The best advice I can possibly give is to be open , be patient and above all , be your- 
self. Also, be extra patient with yourself. If you don't know something, ask. If 
you made a mistake (trust me, you will; we all do), admit it and don't be hard 
on yourself. It's all part of the process of learning and gaining experience. 

This last piece of advice might not be as popular as the others, but it may come 
in handy someday: It's important to be open to the fact that there are many, 
many aspects to music and sound production, and you may find that your 
calling might be better served in another branch of the biz. That's totally OK! 
Change is an important part of any Creative process—that and taxes are the only 
constants you can count on! 


In Conclusión 


D I Y 


do it yourself 


Tutorial: Handling a Successful Session 


Your DIY: you can totally do whatever you want, 
whatever sounds good to you and what’s best for the 


session, the artist and the musió ... and that’s a very 
liberating thing! 


IN CONCLUSION 


Obviously, the above tips are just part of an ever-changing list. The process of 
producing, recording and mixing in any type of studio environment is an 
ongoing, lifelong pursuit. Just when you think you've gotten it down, the tech- 
nology or the nature of the project changes under your feet—hopefully, you'll 
be the better for it and will learn from the process. Far more than just the tech- 
nology, the process of coming up with your own production style and your own 
way of applying the tools, toys and techniques to a production is what makes 
an artist—whether you're in front of the proverbial glass or behind it. Over time, 
your own list of studio tips and tricks will grow. Take the time to write them 
down and pass them on to others, and be open to the advice of your friends 
and colleagues. Use the trade mags, conventions and the Web to lead you to 
new ideas. This way, you're opening yourself up to new insights to using the 
tools of your profession and to finding new ways of doing stuff. Learning is an 
ongoing process ... have fun along the way! 


CHAPTER 21 


Yesterday p Today 
and Tomorroui 


Im sure you've heard the phrase "Those were the good oíd days." I've usually 
found it to be a catch-all that refers to a time in one's life that had a sense of 
great meaning, relevance and all-around fun. Personally, I've never met a group 
of people who seem to bring that sense of relevance and fun with them into 
the present more than music and audio professionals, enthusiasts and students. 
The fact that many of us refer to the tools of our profession as "toys" says a lot 
about the way we view our work. Fortunately, I was born into that clan and 
have reaped the benefits all my life. 

Music and audio industry professionals, by necessity, tend to keep their noses 
to the workaday grindstone. But market forces and personal visions often cause 
them to keep one eye focused on future technologies, whether these are new 
developments (such as advances in digital technologies), rediscovering retro 
trends that are decades oíd (such as the reemergence of tube technology and 
the reconditioning of older devices that sound far too good to put out to 
pasture), or future technologies that excite the imagination. Such is the time 
paradox of a music and audio professional, which leads me to the book's final 
task: addressing the people and technologies in the business of sound recording 
in the past, present and future. 

YESTERDAY 

I've always looked at the history of music and sound technology with a sense 
of awe and wonder, although I really can't explain why. Like so many in this 
industry, I tend to get shivers when I see a wonderful oíd tape machine or an 
original tube compressor. For no reason whatsoever, I get all giggly and woozy 
when I read about some of the earlier consoles that were used to record indus¬ 
try greats (Figure 21.1) or see an original Ampex 200 (the first commercially 
available professional tape machine). I experience the same sense of awe when 
I read about my personal historical heroes such as Alan Dower Blumlein (Figure 
21.2), who was instrumental in developing stereo mic techniques, the 45°/45° 
stereo disc-cutting process, the TV camera and radar. To many, his list of accom- 
plishments is second only to those of Edison. Mary C. Bell (Figure 21.3) who 
was probably the first woman sound engineer also comes to mind, along with 
another unsung hero, the late John (Jack) T. Mullin (Figure 21.4), who stumbled 
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FIGURE 21.1 

Bill Putnam prototype 
mixing consolé at 
Universal Recording, 
Chicago, 1957. 
(Courtesy of 
Universal Audio, 
www.uaudio.com.) 
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FIGURE 21.2 

The lite of Alan 
Dower Blumlein 
(truly, one of my 
long-time heroes) 
has been published 
by Focal Press. 
(Courtesy of Focal 
Press, www. 
focalpress.com.) 


across a couple of Germán magnetophones at the end of WWII 
and was smart enough to send them back to San Francisco. With 
the help of Alexander M. Poniatoff (founder of "AMP"ex) and Bing 
Crosby, Jack and his machines played a crucial role in bringing the 
magnetic tape recorder into commercial existence. 

Every once in a techno-blue moon, major milestones come along 
that affect almost every facet of information and entertainment 
technology. Such milestones have ushered us from the Edison and 
Berliner era of acoustic recordings, into the era of broadcasting, 
electrical recording and tape, to the environment of the multitrack 
recording studio (Figures 21.5 and 21.6), and finally into the age 
of the Computer, digital media and the Web. 

When you get right down to it, the foundation of the modern 
information and digital age was laid with the invention of the 
integrated Circuit (IC). The IC has likewise drastically changed the 
technology and techniques of present-day recording by allowing 
circuitry to be easily designed and mass produced at a fraction of the size and 
cost of equipment that was made with tubes or discrete transistors. 

Advances in digital hardware and software have brought about new develop- 
ments in equipment and production styles that have affected the ways in which 
music is created. Integrating cost-effective yet powerful production computers 
with digital mixing systems, modular digital multitracks, MIDI synths/samplers, 
plug-in effects and instruments, digital signal processors, etc., gives us the recipe 
for having a powerful production studio in our homes, bedrooms, or on the 
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FIGURE 21.3 

Mary C. Bell in NBC’s 
dubbing room #1 
(April, 1948) 
inspecting broadcast 
lacquer dises for 
on-air programs. 
(Courtesy of Mary C. 
Bell.) 



FIGURE 21.4 

John (Jack) T. Mullin 
(pictured left) with 
the two original 
Germán magneto- 
phones. (Courtesy of 
the late John [Jack] 
T. Mullin.) 


bus. Such laptop and desktop music studios have made it possible for more 
and more people to create and distribute their own music with an unprece- 
dented degree of ease, quality and cost effectiveness. 

Peter Gotcher (Digidesign cofounder) was one of the first to envision the cre- 
ation of a cost-effective "studio-in-a-box" (Figures 21.7 and 21.8). This concep¬ 
tual spark, which started a present-day goliath, helped to create a system that 
would offer the power of professional hard-disk-based audio at a price that most 
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FIGURE 21.5 

Gilfoy Sound Studios, 
Inc., circa 1972. 
Notice that the room 
is set up for quad! 
(Courtesy of Jack 
Gilfoy, www. 
jackgilfoy.com.) 



FIGURE 21.6 

Early 1980s at the LA 
Record Plant, Studio 
C. Pictured left to 
right: Tom Dowd 
(producer seated at 
consolé), Jimmy 
Pankow and Peter 
Ceera of “Chicago” 
in rear. (Courtesy of 
the Record Plant 
Recording Studios. 
Photo credit: Diane 
Nini.) 
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FIGURE 21.7 

“Sound Tools,” the 
first cost-effective 
digital audio 
workstation, 
was released by 
Digidesign in the late 
1980s. (Courtesy of 
Digidesign, a división 
of Avid Technology, 
Inc., www.digidesign. 
com.) 



FIGURE 21.8 

Digidesign’s Control 
console/Pro Tools- 
based recording 
system. (Courtesy of 
Digidesign, a división 
of Avid Technology, 
Inc., www.digidesign. 
com.) 


music, audio and media producers could afford. (It's important to realize that 
previous to this, digital Systems started at over $100,000!) His goal (and those 
of countless others since) has been to create an integrated system that would 
link together the many facets that go into audio and audio-for-visual produc- 
tion, via a personal Computer. Years later, this dream has totally transformed 
music and audio production. 

One of the cooler by-products of this digital age is an upsurge in "retro- 
future" trends in music and technology that blend together older and newer 
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technologies. With this has come an increased interest and awareness in 
museums and websites that help us to learn about our musical and 
technological roots throughout music production history. A few of the physical 
and cyber sites that can help give us a better understanding of our roots 
inelude: 

Museums: 

■ Computer History Museum 
1401 N. Shoreline Boulevard 
Mountain View, CA 94043 
(650)810-1010 

info @ computerhistory. org 

■ Experience Music Project 

325 5th Avenue N. (in the Seattle Center, near the Space Needle) 

Seattle, WA 98109 
877.EMPLIVE; (877)367-5483 
www.empsfm.org 

■ Grammy Museum 

800 W. Olympic Boulevard, Suite A245 
Los Angeles, CA 90015 
www.grammymuseum.com 

■ Rock and Roll Hall of Fame and Museum 
1100 Rock and Roll Boulevard 
Cleveland, Ohio 44114 
(216)781-ROCK 
www.rockhall.com 

■ Stax Museum of Soul Music 
926 E. McLemore Avenue 
Memphis, TN 38106 
(901)946-2535 
www.soulsvilleusa.com 


Sites: 


■ Audio Engineering Society Historical Committee (www.aes.org/aeshc) 

■ Synthmuseum.com (www.synthmuseum.com) 

■ Tinfoil.com (www.tinfoil.com) 

When it comes to understanding the tools, toys and techniques of our trade, 
I've always felt that there are a lot of benefits to be gained from looking back 
into the past ... as well as by gazing into the future. A wealth of experience in 
design and application has been laid out for us. It's simply there for the taking; 
all we have to do is search it out and put it to good use. 

TODAY 

"Today" is a really difficult subject to talk about, since new equipment literally 
comes out on a monthly basis. However, it safely goes without saying that the 
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two buzz terms that best sum up the dawn of the 21st century are digital audio 
and the Web. 

l’s and O’s 

The grouping of digital Ts and O's into words that represent alphanumeric 
valúes, sampled voltage levels over time (digital audio) or pixilated color and 
brightness valúes on a screen (digital graphics) have changed Communications 
and Creative production forever. Literally, even the most diehard analog fanatic 
can't escape its far-reaching grasp. 

With the advent of the personal Computer, DAW, cost-effective peripherals, 
application and effects plug-ins and digital downloads, music creation isn't only 
more cost effective than ever before—it also offers a degree of power, flexibility 
and portability that literally boggles the mind! 

Fortunately, this newfound digital technology has spawned a primal urge, 
almost a frenzied lust to marry cutting-edge hardware/software with vintage gear 
or newly reissued toys that are based on decades-old technologies, particularly 
tube devices (such as tube condenser mies, preamps, and signal processing 
gear). So why embrace this older technology? Well, for starters, tube electronics 
inherently have a sound that's very different than their IC- or transistor-based 
counterparts. When a tube is overdriven to the point of clipping, the square 
edge of the distorted signal is actually rounded off. This tends to yield a 
smoother, fuller, more "rounded" distortion, when compared to a sharply dis¬ 
torted, harsh edge that's usually exhibited by transistor and IC Circuit designs. 
In addition, tubes generate even-order harmonics, which are far more musical 
sounding in nature than the harsher odd-order distortions that are generated 
by its modern solid-state counterparts. 

The WWW 

Beyond the overall concept of digital media, another huge mover and shaker of 
everyday life is the Web. Cyberspace made the creation of this book much easier 
for me as a writer (search engines and company sites make research a relative 
breeze, and photos can be quickly e-mailed). Although many of the music share 
sites that contain ripped downloads of major releases have been shut down, 
legitimate pay-per-download sites have sprung into megabuck action. Personal 
and indy music sites have allowed upstart and established artists to directly sell 
their music, inform their fans about upeoming tours and publish fanzine info 
to keep their public begging for more. In fact, from a philosophical standpoint, 
I feel that the trend toward the breakdown of the traditional music industry 
isn't all bad news. I'm definitely not alone in thinking that this shift has taken 
the overall power out of the hands of a few ... and has given it back to the 
indies and to the individual "many." You might even say that the shift toward 
downloadable distribution has taken music production back into the home 
where it began several centuries ago with the popularity of having the family 
piano or other instruments in the parlor. 


omorroiu 


In short, although we're still at the dawning of the Web distribution age, cyber- 
space has already proven itself to be a source of increased visibility, viability 
and hope for the budding artist, as well as for the established corporate world 
of the music biz. 


TOMORROW 

Usually, I tend to have a decent handle on the forces that might or might not 
help shape the sounds and toys of tomorrow ... but it's getting increasingly 
harder to make specific predictions in this fast-paced world. Today, there are far 
more choices for gathering information and entertainment than reading a book 
or watching Lucy and Desi on the tube. Now, we can interact with others in a 
high-speed, networked environment (Figure 21.9) that lets us be more than just 
spectators; it lets us participate and share our thoughts with others, which leads 
to Creative discourse and personal growth. 

This idea of intercommunication through the WWW has already begun the drive 
of almost every developing technology toward an e-based commerce that's 
based on the distribution and sharing of goods, information and media-on- 
demand. With the advent of high-speed, broadband technology, one of the 
greatest concerns facing intellectual and media properties is the need for copy 
protection: the concept that even though it only takes 5 seconds to copy a file, 
the intellectual copyright of a song, video, book or program has a valué that 
must be compensated for. Unless the listening public grasps that concept, media 
providers are in for a harder ride than they're already having. One thing's for 
sure, copy protection issues (such as digital rights management or DRM) are 


FIGURE 21.9 

There are numerous 
sites that allows 
musicians to 
network, collaborate 
and remix other 
folks’ work on the 
Web. (Courtesy of 
indaba music, www. 
indabamusic.com.) 
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having a tough time being accepted, and it'll be interesting to see how these 
issues resolve over time. 

As I write this, digital audio has and will continué to become more portable, 
more virtual and more powerful. For example, I have a killer laptop production 
system that fits snuggly into my M-Audio backpack and a media-phone that'll 
practically cook eggs for breakfast. Although my main surround-sound studio 
is still centered around keyboards, synths and music controllers of various types, 
I've definitely welcomed the continued march toward quality virtual Instru¬ 
ments and useful plug-ins that seamlessly intégrate into the DAW. I think Fll 
always marvel at a Computer's ability to be a chameleon—one moment it's a 
music production system, next it's a video editor, then a word processor, then 
a graphic workstation, then a partridge in a pear tree. This aspect of technology 
literally frees us to be Creative in an amazing number of ways that's truly a thing 
of joy. 

One of my greatest hopes for the not-so-distant future is that the audio produc¬ 
tion community will embrace surround sound more fully. As my buddy George 
Massenburg says ... surround sound is to stereo as stereo is to mono. The leap 
from stereo to surround production in a studio environment can range from 
being simple and straightforward (with certain DAW Systems) to being a finan- 
cial and logistical challenge (especially for the uninitiated user); however, the 
knowledge and other rewards that are gained from learning surround produc¬ 
tion, mastering and media delivery are well worth it. I mentioned in an earlier 
chapter that as a kid I'd hold double álbum jackets behind my head to fill in 
the rear reflections. (Try placing reflective surfaces behind your two ears and 
angle them toward the stereo speakers. Can you hear the difference?) Similarly, 
those of you who have a home theater system might try playing your favorite 
stereo record or CD, while in the Dolby Pro Logic or SRS surround mode. The 
sound will either get sucked into the center speaker (in which case, it's best to 
switch back to stereo) or it'll widen out into the surround field in a way that 
just might cause your mouth to drop. 

HAPPY TRAILS 

Before we wrap up the 7th edition, I'd like to take a moment to honor one of 
the greatest forces driving humanity today (besides sex) ... the dissemination 
and digestión of information. Through the existence of quality books, trade 
magazines, university programs, workshops and the Web, a huge base of infor¬ 
mation on almost any imaginable subject is now being distributed to and 
understood by a greater number of aspiring artists and technicians than ever 
before. These resources often provide a strong foundation for those who are 
attending accredited schools as well as those who are attending the school of 
hard knocks. No matter what your goals are in life (or in the business of music), 
I urge you to jump in and read, surf, skip through pages and keep your eyes 
and ears open for new sounds, ideas, technologies and experiences. The knowl¬ 
edge and skills you gain will always be well worth the expended time and effort. 


Happy Trails 


On a final note, I'd like to paraphrase Max Ehrmann's "Desiderata," when he 
urges us to keep interested in our own career, however humble, as it's an impor- 
tant possession in the changing fortunes of time. Through my work as a pro- 
ducer, musician, writer and educator, I've been fortúnate enough to know many 
fascinating, talented and fun people. For some strange reason, I was born with 
a strong drive to have music and production technology in my life. By "keeping 
interested in my own career" and working my butt off, while having several 
brushes with extreme luck, I've been able to turn this passion into a successful 
career. 

To me, all of this comes from following your bliss (as some might cali it), lis- 
tening to reason (both your own and that of others you trust) and doing the 
best work that you can (whatever it might be). As you know, thousands of able 
and aspiring bodies are waiting in fine to make it as an engineer, a successful 
musician, a producer, etc. So how does that one person make it? By following 
the same directions that it takes to get to Carnegie Hall—practice! Or as the 
t-shirt says, "Just do it!" Through perseverance, a good attitude and sheer luck, 
you'll be led through paths and adventures that you never thought were 
possible. Remember, being in the right place at the right time, simply means 
being in the wrong place a thousand times ... "Showing up in life is huge!" 


Have fun! 



David Miles Huber 


www.5 lbpm.com 








APPENDIX A 


Continuing 

Education 


As the equipment used by engineers, producers and musicians places an ever- 
increasing emphasis on technology, education must play a greater role in the 
understanding of basic industry skills. Education can take many forms, ranging 
from a formal education to simply keeping abreast of industry directions from 
the many industry magazines. 

Beyond getting the necessary hands-on education by playing with all of the tools 
and toys, one of the best ways to get a better handle on the pulse of this indus¬ 
try is to read. Read all the magazines, books and articles that you can get your 
hot-little hands on. Of course, there's a ton of schools that offer courses in 
recording, media and music production. If this approach works for you, I'd 
recommend that you carefully check the institution and department out before 
potentially laying down your hard-earned cash to further your education towards 
getting into the music biz. 

THE WEB 

I obviously encourage you to be Creative about jumping onto a high-speed 
Internet connection. Go to your local library, bug a friend, hang out at your 
favorite wifi coffee shop with your laptop—whatever it takes to surf the world- 
wide info-stream. 

Most media-related magazines have a Web presence that lets you read and 
download the latest issue and even dig deeper by watching the latest technology 
videos, audio examples and the like. Users can also use the Web to communi- 
cate using public and prívate forums (allowing the users to voice their com- 
ments and ideas to other users and manufacturers) in addition to contacting 
the company directly using electronic mail Services. 

The most important lesson to be learned from this short appendix is to "Never 
Stop Learning"! Put a magazine rack in the bathroom, check out new tech- 
nologies, see who's doing what, surf the mediawaves and soak it all in ... and 
then put that knowledge into practice. The art of expanding your mind will 
soon transíate into an expanded set of skills that will help keep your skills and 
your outlook on life keen and sharp! 


Getting Organized! 


GETTING ORGANIZED! 

One of the less obvious, but most important tools for getting yourself out into 
the world, happens through the joining of industry organizations and going to 
industry conventions. On a personal note, most of my best friends and industry 
contacts have come to me through attending conventions such as NAMM and 
the AES ... while personal contacts and priceless opportunities have come to 
me through joining and actively participating in "The Recording Academy" 
(better known to most as the Grammy organization). 

LJsing the all-important adage "Showing up is huge!"... going to special indus¬ 
try events, conventions and music happenings will put you smack-dab in the 
middle of the action, putting all the people that are in the best position to help 
you out with information, contacts and even jobs—right there in the same room. 
Literally stepping up to the píate and helping with these organizations and 
events can help connect you to the industry in ways that are amazingly huge. 

For all of you students ... my first suggestion would be to immediately join 
Grammy U (www.grammy.com/grammyu or www.myspace.com/grammyu). 
For a very nominal price, this membership would allow students living near 
one of the many local chapters across the country to attend special Grammy 
events, lectures and parties. It's WEFF WORTH the time and effort! Here's what 
the site has to say about "The U." ... 

"The GRAMMY University NetWork (GRAMMY U) is a unique and fast-growing 
community of college students who are pursuing a career in the music industry. 
The Recording Academy® created GRAMMY U to help prepare students for 
careers in the recording industry through networking, educational programs 
and performance opportunities. GRAMMY U is designed to enhance students' 
current academic curriculum with access to recording industry professionals to 
give an "out of classroom" perspective on the recording industry. 

A small list of events and organizations ineludes: 

Conventions: 

■ AES (Audio Engineering Society)— www.aes.org 

■ APRS (Association of Professional Recording Services, UK)—www.aprs. 
co.uk 

■ Musikmesse—musik. messefrankfurt. com 

■ NAMM (National Association of Music Merchants)— www.namm.org 
Organizations: 

■ AES (Audio Engineering Society)— www.aes.org 

■ APRS (Association of Professional Recording Services, UK)—www.aprs. 
co.uk 

■ The Recording Academy (Grammy)— www.grammy.com 

■ SPARS (Society of Professional Audio Recording Services)—www.spars. 
com 

Happy reading, surfing and learning! 


APPENDIX B 


Tax Tips for 
Musicians 


by Jeffrey P. Fisher 


Author's Preface: Some might wonder why I included this excellent article on taxes 
within this book. Well, it's a well-known fací that artists (and the general public at 
large) aren't always money conscious ... and with the burden of student loans, equip- 
ment cost and the decisions that goes with day-to-day living, it's more important than 
ever to learn the valué of careful fmancial decisions that could effect your credit rating, 
your ability to save and your fmancial security. Understanding the valué of money, the 
power of a personal budget and the responsibilities that go with incurring a load of 
debt are just as important as anything you might learn in this book. 

When I started in the biz, I had very little money and was living on my own on the 
West coast. Since I've never been one for living on the edge, I quickly learned the 
necessity ofsaving up for times when the going might get rough. As a result, I've always 
been able to set aside enough to Uve comfortably, without the fear that goes with incur¬ 
ring too much debt. Over the years, this simple idea has given me freedoms that I 
never would've thought possible—all through the simple concept of saving for a rainy 
day. llnderneath all of the technicalities of Jeffrey's article is the moral ofsaving, living 
within your personal means and setting back enough bucks to enjoy the fruits of your 
artistic labors. —DMH 

Everybody complains about taxes. But few do anything about it. Well, there are 
several actions you can take to improve your tax situation right now. If you're 
making even the tiniest amount of music-related scratch, there's no reason to 
pay more taxes than you have to. To reap the most tax benefits, start running 
your music career as a legal small business. The IRS loves small business. Accord- 
ing to the Small Business Administration, there are 25 million small businesses 
in the U.S. today. And a large percentage of them are solé proprietorships—one 
person shops. As a solé proprietor, you report your music business income as 
part of your personal income using Schedule C (and a few other forms). 

A BUSINESS OF YOUR OWN 

It makes real financial sense to run your project studio as legal business. Follow 
these basic steps: 








"But, Dude, I Gotta Eat” 


■ Set up your business by choosing its legal structure (sole-proprietorship, 
partnership, Corporation, etc.)—Consult with a tax adviser for details 
about the financial aspects of each form. Contact a legal adviser for 
answers to liability issues. 

■ Get legal—Make sure you meet any regulations that pertain to running a 
business in your town. For example, you may have to get a business license 
from your local clerk's office. 

■ File a doing-business-as (dba) with your local government if you cali your 
business something other than your legal ñame—You may need a sepárate 
tax ID for your business and some States require a sales tax ID number. 

■ Open a business checking account—Deposit your project studio income 
into it and pay your business expenses using checks drawn on it. Also, use 
a credit card only for business purchases and pay it off on time from your 
business checking account. 

■ Use bookkeeping software to track all your business income and 
expenses—This makes tax preparation and monitoring your financial 
situation easier. Understand the various tax consequences of your busi¬ 
ness, too. You'll probably need to make quarterly tax payments in addition 
to yearly tax preparation. 

■ Protect yourself through health and property insurance—Also, consider 
life, disability, and liability insurance if it makes sense for your situation. 

It all comes down to income and expenses—the money you make and the 
money you spend. The more you make, the more you pay in taxes. Simple, right? 
Even the most convoluted of IRS instructions make that point painfully clear. 
That means the inverse is also true. Since the IRS only taxes your business profits, 
cut back on the profit and pay less taxes. 

“BUT, DUDE, I GOTTA EAT” 

Whoa there. Im not saying that you should earn less. Instead, look for all the 
possible ways to convert some of your everyday expenses into legitimate busi¬ 
ness deductions. Even some personal expenses may be deductible against the 
business. The more expenses you have, the more you reduce your taxable 
income. And since you were going to spend the money anyway, you might as 
well realize some tax benefits, too. 

WRITE-OFFS 

Basically, all the expenses you incur to run your little music business are deduct¬ 
ible. To be fully deductible, however, these business expenses must be "ordinary 
and necessary" according to the IRS. That's just fuzzy enough to be dangerous. 
Ordinary means the expenses must be typical for the business. Buying a new 
guitar could apply; buying a dishwasher wouldn't. Necessary just means the 
expense is vital to the success of your business. Office equipment, postage, 
phone charges, graphic design charges, recording studio fees, duplication, dues, 
magazine subscriptions, and other such related Ítems are definitely necessary 
for the success of the typical music business. 
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Here is a basic list of business deductions for musicians: 

■ Advertising and promotion costs 

■ Car and truck expenses 

■ Commissions and fees you pay to other people/businesses 

■ Depreciation and Section 179 deduction (see article) 

■ Insurance (except health insurance—see article) 

■ Interest (on business loans) 

■ Legal and professional fees (hire a lawyer; deduct the charge) 

■ Office expenses 

■ Rent/lease payments 

■ Repairs and maintenance 

■ Supplies 

■ Taxes and licenses such as a business license 

■ Travel costs 

■ Meáis and entertainment (see article) 

■ Utilities (see home-office deduction in article) 

■ Wages, salaries paid 

For every $100 you earn, you pay approximately $45.30 in taxes (if you're in 
the 27% tax bracket, pay the 15.3% self-employment tax, and send an addi- 
tional 3% to your State). Of course, that also implies that for every legitimate 
$100 business expense you incur, you also save the $45.30 you would otherwise 
pay in taxes. Hey, that's like getting everything you buy at a discount. 

Why does the IRS let you deduct all these expenses? They want you to succeed. 
So, they let you invest money (spend it) in your business as incentive for you 
to earn more. And the more money you make, the more you'll pay in taxes. You 
see they have an ulterior motive; they ain't jus' bein' neighborly. 

But there's a caveat (isn't there always?). You need to be gainfully engaged in 
making a buck. You must turn a profit in your business three out of every five 
years or your business will be classified as a hobby, and you forfeit the expense 
deductions. Bottom line: very bad news and very high tax bilí. (And one dollar 
in profit those three years ain't gonna make you popular down at the Treasury 
Department neither). 

Since the burden of proof falls solely to you, it's vital that you record all your 
music business income and expenses diligently. A shoebox full of receipts does 
not a bookkeeping system make. Get help setting up your books or look for a 
software solution to help document your business financial transactions. 

Another important gotcha: If you're just launching your music business, startup 
expenses can't be deducted all at once. You must amortize them over five years 
by taking 20% portions of the total expenses and deducting them over five 
consecutive years. 

GEAR LUST = TAX SAVINGS 

Did you know that the gear you buy for making your music magic could be a 
sweet tax deduction? Under Section 179 of the tax code you can deduct or 



Home Suieet Home 


"expense" up to $100,000 of tangible property and write it all off when you 
prepare your taxes. For tangible property think expensive, long-lasting Ítems, 
such as a new Computer. This amount can be above and beyond many other 
normal business expenses you might incur. 

If you've had a particularly strong earnings year, you might want to offset some 
of that gain by deducting all the cost of large purchases in one year (up to the 
limit). Alternately, you can choose to deprecíate what you buy and deduct a 
portion of those costs over the next several years. 

HOME SWEET HOME 

If you do the majority of your music work in your home office, you can deduct 
a portion of the same expenses that now do little or nothing to lessen your tax 
burden. You can write off rent or mortgage interest, property taxes, Utilities (gas, 
electricity, water/sewer), insurance, repairs, and depreciation. First, dedícate a 
portion of your home entirely for your music business. Keep it free of personal 
ítems and make it your primary business location. Beware that if you do most 
of your work elsewhere (gigging, for instance), and only use this home office 
occasionally, your deduction may be limited or entirely verboten. 

Here's how to figure your deductions. Total up the square footage of this exclu¬ 
sive and principal place and compare it to the total square footage of your crib. 
Say your math works out to 10% (100 square feet of a 1000 square foot home— 
you need a bigger place!). You can then deduct 10% of the aforementioned 
expenses using Form 8829: Expenses for Business Use of Your Home. The total 
deduction then flows through to your Schedule C, reducing your income, and 
therefore your taxes, considerably. 

There is a recapture clause (which doesn't apply to renters). If you sell your 
home and make a profit, those profit dollars become taxable business income 
at the same percentage rate as your deduction. Score a $50,000 gain (good for 
you!), and, following the above example, $5000 of it belongs to the business 
(subject to self-employment tax and regular income tax, of course). It's impor- 
tant to note that the personal income you make from a house sale is generally 
not taxed, though! Stop taking the home-office deduction for two tax years prior 
to the home sale and this recapture clause doesn't apply. 

SELF-EMPLOYMENT TAX 

Yes, we self-employed have a special tax just for us. Actually every worker pays 
the same tax—funding for Social Security and Medicare—it's just a little differ- 
ent when you're on your own. You must contribute both the employee and 
employer contributions, which total up to a whopping 15.3%. Yep, just over 
fifteen pennies on every buck you earn goes right into the Social Security kitty. 
This is, of course, befo re you start paying any regular income taxes. Ouch! And 
you have to pay the self-employment taxes (along with income taxes) quarterly. 
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You need to predict what you are going to earn this year, and the taxes that 
would be due on that dollar amount. Then, you send in 25% of that money on 
April 15, June 15, September 15, and January 15 of the next year. These esti- 
mated tax payments are important, because if you don't pay enough, there's a 
penalty due the next April 15. 


HEALTH INSURANCE 

You can deduct the premiums paid for health insurance, too. This doesn't come 
off the Schedule C, but is actually a front page deduction on your personal 1040. 
Self-employed individuáis can deduct 100% of the premiums they pay (unless 
Congress changes its collective mind). Other typical medical costs are deduct- 
ible on Schedule A (if you qualify). 


EAT, DRINK, AND BE MERRY 

When you entertain your clients, the money you spend is another write-off. 
However, these meáis and entertainment are subject to a 50% limitation. Spend 
$100 on a pizza party, and take $50 off on Schedule C. Give clients gifts, up to 
$25 per client, and you can take that as a full deduction, though. 

When you travel as part of your music business, those expenses are deductible 
including airfare, lodging, and meáis. You must support your travel and lodging 
deductions with receipts. However, instead of keeping track of your meáis, you 
can take the government's standard per diem allowance of $30 for Meáis and 
Incidentals. Other cities may have higher amounts so check the official Web site 
(www.policyworks.gov/perdiem). Meáis on the road are, of course, still subject 
to the 50% limit. The IRS figures you gotta eat anyway, so they limit the expense. 
Bummer. 

VEHICULAR DEDUCTIONS 

Yes, that oíd beater is worth money! Keep track of actual vehicle expenses (gas, 
repairs, etc.) or take the standard mileage rate (which changes every year; check 
with the IRS). In either case, you must document the miles you drive for busi¬ 
ness, date, and purpose of trips, along with expenses incurred. A dedicated 
notebook/diary earns a gold star from the IRS. 

Even if you use your ride for business and personal use, the business portion 
of your expenses is still deductible. Determine your business percentage by 
dividing your business miles by the total miles driven (2500 business/10,000 
total = 25%). If you just use the standard mileage rate, multiply your business 
miles driven by that rate (2500 x $/mile = deduction) to arrive at your deduct¬ 
ible expense amount. You can also deduct the full cost of tolls and parking fees 
incurred while on business. And the loan interest on the car is deductible 
(subject to the business use percentage, if it applies). 
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FEED THE NEST EGG AND SAVE, TOO 

You also save money by contributing to a qualified retirement plan. The IRS 
makes it easy to sock away some cash for a rainy day and rewards you with a 
nice, fat deduction each year. This is another 1040 deduction, not Schedule C. 
IRAs are the first method that pop up. 

However, they're limited to $2250 (which changes regularly). With a SEP (Sim- 
plified Employee Pensión) you can deduct as much as 15% of your business 
income topping out at $30,000 total per year. The more you put away, the more 
you save. And since you're really helping yourself down the road, it's a smart 
way to manage your taxes and your retirement. For some of us, a Roth IRA may 
be more prudent. Roths give you no up front deduction, but the earnings are 
tax-free. Talk to a financial planner. 


EOY TAXTIPS 

At the end of each year, you have another opportunity to reduce your tax burden: 
accelerate expenses and decelérate income. First, spend some cash on business 
expenses. Don't just blow the wad. Make sensible purchases this year that will 
reduce your taxable income. Ideal last-minute purchases inelude postage, equip- 
ment, general office supplies, and promotions. You can also pay your mortgage 
and health insurance premium before the year-end to realize some other tax 
savings on the personal side. Second, put off collecting money this December 
to January by billing your clients a little later. Though you'll have to pay taxes 
on the money eventually, you defer that payment for a whole year. 


GET HELP FROM THE IRS 

Surf on over to the always exciting IRS Web site (www.irs.gov) and download 
the free guides that explain the specific tax benefits for small business 
owners. 

■ #334, Tax guide for small business 

■ #463, Travel, entertainment, gift, and car expenses 

■ #533, Self-employment tax 

■ #535, Business expenses 

■ #583, Starting a business and keeping records 

■ #587, Business use of your home. 

FINAL WORD 

While we all have to pay taxes, we are only required to pay our fair share. Make 
sure you are not throwing money out the window. Take advantage of these and 
all the other tax breaks available to you. And put more music money in your 
pocket ... where it belongs! 
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AAC (Advanced Audio Coding), 

379 

AAF, see Advanced Authoring 
Format (AAF) 

Ableton Live "Simpler" software, 

226 

Ableton Software, 297-299 
Absorption 

commercial examples, 105 
control room design, 102-104 
high-frequency, 104 
low-frequency, 104-107 
Absorption coefficient 
equation, 102 
materials, 103 

Accelerator cards, DAWs, 278-279 
Accent miking, 140 
Accent pickup, definition, 140 
Accessories, DAW hardware, 244 
AC3 (Dolby Digital), 281, 373, 536, 
547, 550, 553-555 
ACID Pro, 297 
Acoustic guitar 

compression, 496 
frequency, 485 
Acoustic isolation 

and acoustic space, 73 
ceilings, 87 
floors, 84-86 
iso-booths, 90-91 
iso-rooms, 89-90 
noise, 91-92 
overview, 80-81 
partitions, 92 
risers, 86-87 
walls, 81-84 

Windows and doors, 87-89 
Acoustic partitions 

and acoustic space, 73 
function, 92 
leakage reduction, 138 
Acoustic piano, frequency, 485 
Acoustics 

control room design, 107-110 
definition, 73 
factors in, 79-80 
and headphones, 542 



room considerations, 524 
subwoofers, 532-533 
Acoustic shadow, from head, 68 
Acoustic sound chamber, 

characteristics, 109-110 
Acoustic trauma, definition, 63 
Active crossovers, characteristics, 528 
Active monitors 
design, 526, 528 
in surround, 551-552 
Adaptive filter, see Single-ended 
noise-reduction process 
ADAT lightpipe system, 218-219,467 
ADAT standard, 233 
A/D converter, see Analog-to-digital 
(A/D) converter 
Additive synthesis, 342 
Address, timecode, 394-395 
Adobe Audition 3, 521 
Adobe Audition WMA9 surround 
encoder, 557-558 
ADR, see Automatic dialog 
replacement (ADR) 

Advanced Audio Coding (AAC), 379 
Advanced Authoring Format (AAF), 
256 

AEA A440 ribbon microphone, 114 
AEA MS-38 Mark II, 144 
AES, see Audio Engineering Society 
(AES) 

AES/EBll protocol, 215-216 
AES31 standard, 256 
AFA A440 ribbon microphone, 114 
Aftertouch messages, MIDI specs, 
328 

AIFF format, 255, 375 
Air 

absorption coefficients, 103 
vibrating mass effects, 43 
wave movement, 42 
Air Lyndhurst, 7 
Air suspensión, speaker design, 
525-526 

Akai MPC1000 Music Production 
Center, 344 

AKG C460B/CK61 ULS, 124 
AKG C3000B condenser 

microphone, 116, 178 


AKG C3000B microphone, 176 
AKG DI 12 microphone, 172, 174 
AKG D321 microphone, 124 
Alesis DM5 microphone, 352 
Alesis iMultiMix 16, 228 
Alesis lightpipe system, 218 
Alesis SR-18 stereo drum machine, 
350 

Algorithms 
delay, 504 
DSP effects, 271 
dynamic range, 272 
loop-based audio, 295 
noise reduction, 517-518, 520 
pitch-shifting, 293 
Real Audio, 379 
signal processing, 470 
synths, 342 

Aliasing, digital sampling, 205-206 
Alignment, magnetic tape heads, 
196 

Ambient miking 

techniques, 141, 145 
tutorial, 146 

Ampex decks, function, 186 
Amplifiers (amp) 

amplification process, 415-417 
core component, 415 
digitally controlled, 421-422 
distribution amp, 419-420 
operational type, 418 
power amp, 420-421 
preamps, 130-132, 219, 418, 
489, 537 

summing amp, 419 
voltage-controlled, 421-422 
Amplitude 

wave cancellation, 50 
waveforms, 43-45 
Analog audio technology, vs. digital 
example, 200 
Analog noise reduction 

compansion process, 515-516 
overview, 513-514 
Analog sequence editing, recording 
process, 35-36 

Analog signáis, vs. digital sampling, 
201 
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Analog tape modulation 
limiters, 500 
tutorial, 514 

Analog tape recorder (ATR), see also 
Magnetic tape 
archive strategies, 196-197 
backups, 196-197 
benefits, 181-182 
bias current, 189-190 
cleanliness, 195-196 
degaussing, 196 
equalization, 189 
erase head, 189 
history, 181 
media, 182-183 
monitoring modes, 190-192 
playback curves, 190 
playback head, 188 
print-through, 194-195 
professional, 184 
punch-in, 192-193 
record head, 187-189 
tape, tape speed, heads, 

193-194 
tape storage, 195 
tape transport, 184-187 
track transfer, 182 
Analog-to-digital (A/D) converter 
jitter, 222 
oversampling, 206 
transducers, 39 
Anderton, Craig, 361 
Angle of incidence, and polar 
patterns, 122 

Anti-aliasing filters, 206-207 
Aphex Model 622 Logic-Assisted 
Expander/Gate, 501 
API 550A equalizer, 483 
Apple AIFF format, 255 
Apple Boot Camp, 239 
Apple GarageBand™, 296 
Apple iTunes®; 389 
Apple MacBook Pro, 243 
Apple Macintosh HFS, 372 
Apple Mac Pro™, 241 
Apple Quicktime, 378, 557 
APRS, see Association of 

Professional Recording 
Services, UK (APRS) 

Apserity noise, 514 
Archiving 
ATRs, 197 

DAW Systems, 249, 285-287 
recording project tips, 606-607 
Arrangers, as personnel, 19 
Artificial reverberation, and 
engineer, 71 
Art'n Audio Studio, 311 


ASIO, see Audio Stream Input/ 
Output (ASIO) 

A6 Andrómeda analog-modeled 
digital synthesizer, 343 
Asperity noise, 514 
Assistant engineer, as personnel, 
20-21 

Association of Professional 

Recording Services, UK 
(APRS), 624 

ATR, see Analog tape recorder (ATR) 

ATRAC, 229-230 

ATR-108C recorder, 185 

ATR-102 recorder, 185 

Attack 

compressor, 493-494 
definition, 57 
noise gate, 501-502 
Attenuation, filters, 480-481 
AU, see Audio Units (AU) 

Audience (seated), absorption 
coefficients, 103 

Audio 

bit rate and file sizes, 376 
DAW driver protocols, 251-252 
DAW hardware, 244 
DVD formats, 373 
Internet distribution, 10 
laptops, 244 

loop-based software, 295-299 
MIDI system connections, 314 
surround sound codees, 548 
tracks from MIDI, 364 
for video and film, 17 
Audio Engineering Society (AES), 
624 

Audio Interchange File (AIFF), 255, 
375 

Audio interface 
DAWs, 249-252 
and MIDI, 338 
surround sound, 552-553 
Audio production consolé (mixer), 
see Mixing systems 
Audio recorders, real-world sync, 
409-410 

Audio Stream Input/Output (ASIO), 
252 

AudioSuite, 277, 471 
Audio Units (AU), 277, 471 
Audition 3 (Adobe), 521 
Audition WMA9 surround encoder 
(Adobe), 557-558 
Auditory perception 
beats, 65-66 
overview, 64-65 
Automatic dialog replacement 
(ADR), audio-for-visual 


production environment, 
75-76 
Automation 

DAWs, 237, 512 
effeets, 512 

mixdown and effeets, 460-461 
mix and effeets, 279-281 
rubberbands, 463-464 
signal processing plug-ins, 471 
virtual mixer examples, 462 
Auxiliary controllers, electronic 
instruments, 341 
Auxiliary send 

mixing systems, 435-438 
parallel routing, 475 
AVANTONE MixCubes Full-Range 
Mini Reference Monitors, 542 
Average absorption coefficient, 
equation, 103 

Average signal level, dynamic range, 
487, 492 

Averaging meter, dynamic range, 
490-491 

B 

Back catalog material, surround 
sound, 561 
Backups 

ATRs, 196-197 
DAW systems, 249, 285-287 
investment protection, 610 
recording project tips, 606-607 
Sys-Ex messages, 336 
Baffle 

recording process, 28 
S5-2F "Sorber" baffle, 93 
Balanced fines, mic characteristics, 
128-129 

Balancing, and mixing, 463-467 
Bandpass filter, 482 
Bandwidth 

noise reduction, 520 
peaking filters, 479-480 
Basic tracks, recording process, 
425-426 

Bass boost, voice miking, 168 
Bass guitar 

compression, 496 
frequeney, 485 
peak metering, 488 
Bass management, monitoring, 533 
Bass reflex, speaker design, 525-526 
Bass trap 

definition, 104 
functional, 107 
pressure-zone trap, 105, 107 
quarter-wavelength bass trap, 
104-105 
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BD, see Blu-ray disc (BD) 

Beats 

characteristics, 65-66 
slicing, 293 
tutorial, 66 

Beatscape groove module, 304 
Beats-per-minute (bpm), MIDI 
session, 355 

Bed tracks, recording process, 
425-426 

Behringer Denoiser system Model 
SNR2000, 517 

Behringer UltraGain preamp, 219 
Behringer llltrapatch PX2000, 450 
Bell, Alexander Graham, 58 
Bell, Mary C., 613, 615 
Beyerdynamic MI20 ribbon 
microphone, 114 
Beyerdynamic MI60 ribbon 

microphone, 114, 172-173, 
175 

BFD acoustic drum library module, 
350 

BFD2 acoustic drum library module, 
306 

Bhringer UltraGain Pro-8, 219 
Bias current, ATRs, 189-190, 191 
BiCoastal Music, 75 
Bidirectional polar pattern, mic 
characteristics, 119 
Binary number system 
digital audio, 199 
quantization, 203 

Binaural localization, definition, 67 
Biphase modulation 
definition, 396 
SMPTE timecode word, 397 
Biscuit, vinyl disc manufacture, 595 
Bit rates 

audio, 376 

digital sound files, 213-214 
mastering process, 572 
sound files, 257 
Bits, quantization, 203 
Bleed, see Leakage 
Blue Book CD, 372 
Blumlein, Alan Dower, 142, 
613-614 

Blu-ray disc (BD), burning, 

589-590 

Board, see Mixing systems 
Bongo, frequency, 485 
Bookshelves, reflections, 102 
Boot Camp (Apple), 239 
Boot drive, DAW systems, 286-287 
Boudreau, Matt, 8 
Boundary microphone, 135 
bpm, see Beats-per-minute (bpm) 


Brain 

direction perception, 68-69 
doubling process, 71, 504 
reverb, 71 

space perception, 69-70 
Brass instruments 
compression, 496 
mic placement, 150-152 
Brick 

absorption coefficients, 103 
acoustic isolation, 84 
surface density, 82 
Bridgeport Music et al. v. Dimensión 
Films , 347 

Broadcast wave format, 255, 375 
Broken Radio Studio, 4, 8 
Brooks, Evan, 405 
Bryston 9B SST professional 

5-channel amplifier, 421 
Building materials, surface densities, 
82 

Burning speeds, CD-R/RW, 586 
Bytes, MIDI message, 323-324, 

327 

c 

C, see Capacitance (C) 

Cambridge equalizer, 273 
Cancellations 

microphones, 52 
room reflections, 98-99 
wave amplitude, 50 
Capacitance (C) 

and charge, 115-116 
and sound waves, 117 
Cardioid microphones 
AKG DI 12, 172 
guitar amp miking, 153 
Neumann KM 180 series, 175 
pickup pattern, 123 
surround Decca tree, 146 
Cardioid polar patterns, mic 
characteristics, 120-122 
Career development, considerations, 
24-26 

Carpet, absorption coefficients, 103 
Cars, monitoring in, 543 
Cascade Microphones Fat Head 
ribbon microphone, 114 
CBR, see Constant bit rate (CBR) 

CD Baby, 587 

CD-R, see Compact Disc-Recordable 
(CD-R) 

CD-RFS format, 372 
CD-ROM/XA format, 372 
CD-RW, see Compact Disc- 
ReWritable (CD-RW) 

CDs, see Compact Dises (CDs) 


CD-UDF format, 372 
Ceera, Peter, 616 

Ceilings, and acoustic isolation, 87 
Cello 

miking, 167 
radiation patterns, 56 
Central processing unit (CPU) 
DAW, 243, 282 
electronic instruments, 340 
mixing systems, 435, 437 
Chairs, absorption coefficients, 103 
Chamber effect, 508 
Channel assignment, mixing 
systems, 446-448 
Channel fader, mixing systems, 

443-444 

Channel input, mixing systems, 
432-435 

Channel Pressure messages, MIDI 
specs, 328 
Channels 

inline effect, 474 
MIDI, 324-326 
Channel select, tutorial, 459 
Channel Voice messages, MIDI 
specs, 327-330 
Charge (Q) 

electret-condenser mic, 118 
equation, 115-116 
Chirping, 520 
Chorusing, 504 
Cinema display (Apple), 241 
Cinemascope, 546 
Circle Surround format, 557 
Clarinet, miking, 169-170 
Cleanliness, ATRs, 195-196 
Click/pop eliminator application, 
521 

Clicks 

remo val, 522 
sound file editing, 265 
Click track, MIDI session, 356 
Client entertainment, tax tips, 629 
Clipping, amplification process, 
416-417 
Cióse miking 

techniques, 135-140 
tutorial, 140 

Coarse pitch, vinyl disc cutting, 

594 

Combination tones, definition, 66 
Combing, delays, 504 
Commercial control rooms, 
characteristics, 9 
Communication, DAW 
characteristics, 237 
Communication ports, electronic 
instruments, 341 
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Compact Disc-Recordable (CD-R) 
burning speeds, 586 
self-distribution, 587 
technical aspects, 584-586 
Compact Disc-ReWritable 
(CD-RW) 

burning speeds, 586 
technical aspects, 584-586 
Compact Dises (CDs) 

DAW backups, 286 
as delivery media, 371-372 
handling and care, 588 
instrument samples, 346-347 
labeling, 588 
manufacturing 
burning software, 579-584 
process, 578-579 
technical aspects, 577-578 
and MIDI, 338 
secure digital music, 390 
song sequence editing, 35 
Compansion process, 515-516 
Compatibility, sound file formats, 
256-257 

Complex waves, examples, 54 
Composite (comparison), sound 
files, 267 

Compressed codee sound file 
formats, 375 

Compressed passage, vs. dynamic, 
571 

Compression 

applications, 496-497 
compressor, 492-494 
definition, 42 
desktop video, 386 
example, 43 
limiting, 498-500 
mono mix, 496 
recording and mixdown, 495 
tutorial, 497 
Compressor 

Controls, 493-494 
purpose, 492-493 
Computers 

CD manufacturing, 578-579 
control room noise, 92 
DAWs, 236, 238-241, 282, 
284-285 

desktop DAW, 240-241 
DVD/Blu-ray burning, 589-590 
laptop DAW, 241-244 
and MIDI, 337-339 
MIDI daisy chain, 321 
in project studios, 78 
signal processing plug-ins, 471 
timecode routing, 411-413 
Concerts, practice, 609-610 


Concrete 

absorption coefficients, 103 
and acoustic isolation, 84 
surface density, 82 
Condenser microphone 
AKG C3000B, 176, 178 
characteristics, 121-122 
design, 115-117 
electret-condenser mic, 118 
grand piano, 156 
mic selection, 149 
MXLV67Í, 177-178 
Neumann KM 180 series, 175, 
177 

phantom power, 117-118 
Telefunken M216, 179 
Congas 

frequeney, 485 
miking, 166 

Constant bit rate (CBR), 377 
Continué command, MIDI real-time 
messages, 405 

Continuing education, web and 
organizations, 623-624 
Continuous mode, jam sync, 401 
Control Change messages, MIDI 
specs, 329-330 
Controller ID numbers 
listing, 332-334 
MIDI specs, 330-331 
Controllers 

DAW Systems, 252-255 
electronic instruments, 340-341 
MIDI drum, 349-352 
MIDI keyboard, 347-349 
tutorial, 363 
valué editing, 362-363 
Control panel, electronic 
instruments, 340 
Control room 

acoustic isolation 
ceilings, 87 
floors, 84-86 
iso-booths, 90-91 
iso-rooms, 89-90 
noise, 91-92 
overview, 80-81 
risers, 86-87 
walls, 81-84 

Windows and doors, 87-89 
acoustic partitions, 92 
acoustics overview, 79-80 
design 

absorption, 102-104 
frequeney balance, 97-102 
room reflections and reverb, 
107-110 

symmetry, 92-97 


equipment needs, 607-608 
example, 15 
function, 7-9 

Conventions, continuing education, 
624 

Copy & Paste techniques, sound file 
tutorial, 263 
Copyright 

recording project songs, 601 
and web, 390 
CoreAudio, 252 

Cost factors, and acoustic space, 
73-74 

CPU, see Central processing unit 
(CPU) 

Cream Studio, 2 
Crosby, Bing, 614 
Cross-fades 

mastering sequencing, 568 
sound files, 265 

Cross-Interleaved Reed-Solomon 
Code (CIRC), 586 
Crossover networks, speaker design, 
526-528 

Cubase/Nuendo, see Steinberg 
Cubase/Nuendo 

Cues 

mastering volumes, 568 
reverb, 506-507 
Cuniberti, John, 147 
Cutoff frequeney, 480 
Cutting head, vinyl disc 

manufacture, 591-593 
Cyberspace, 388 
Cycle 

definition, 45 
and phase, 48-49 
Cymbal crash 
frequeney, 485 
overhead miking, 163-164 
peak metering, 488 
waveform envelope, 57 

D 

Daisy chain, MIDI systems, 

319-321 

Dampeners 

drum set miking, 160 
snare drum miking, 163 
DAT, see Digital audio tape (DAT) 
system 

Data byte, 324 

Data errors, CD-R problems, 585 
Data management, DAW systems, 
286 

Data transmission, MIDI message, 
323 

dB, see Decibel (dB) 
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dbx system, 516 
DCA, see Digitally controlled 
amplifier (DCA) 

DDLs, see Digital delay devices 
(DDLs) 

"Dead" end 

room layout, 101 
room reflections, 99 
Decay, definition, 57 
Decay time, definition, 71 
Decca tree 

surround Decca tree, 146 
technique, 145 
Decibel (dB) 

auditory perception, 66 
basic concepts, 61 
definition, 58 
power, 60-61 

sound-pressure level, 59-60 
voltage, 60 

Degaussing, ATRs, 196 
Delay 

characteristics, 503-504 
DSP effects, 274, 276 
15 ms, 504 

15 to 35 ms, 504-505 
35+ ms, 504-505 
tutorial, 506 

Delay units, and engineer, 71 
Delivery formats 

compressed codee sound files, 
375 

digital audio, 374-375 
Dolby Digital (AC3), 553-555 
Dolby Pro Logic, 555-557 
surround sound, 553-559 
uncompressed sound files, 375 
Delivery media 
CD, 371-372 
DVD, 371, 373 
web, 373-374 
Denise Rich Songs, 11 
Depth control, vinyl disc-cutting 
lathe, 592 

Desk, see Mixing systems 
Desktop video, and multimedia, 
386-387 

Details, recording project tips, 
597-598 

Device driver, multimedia 

environment, 370-371 
DI, see Direct injection (DI) box 
Diaphragms 

condenser mic, 115 
dynamic mic, 112-113, 121 
ribbon mic, 113 
stereo mies, 143 
transient response, 125 


Diffraction 

obstacle effects, 48 
sound, 47 
Diffusers 

commercial examples, 100 
room reflections, 99 
Digidesign Control console/Pro 
Tools, 617 

Digidesign 003 control surface, 

254 

Digidesign DINR noise-reduction 
plug-in, 520 
Digidesign founder, 615 
Digidesign ICON, 254 
Digidesign/PMC RM1/2 studio 
monitors, 541 

Digidesign "Sound Tools," 617 
Digital audio editor, mastering 
process, 572-573 
Digital audio recording systems 
ADAT MDM format, 233 
DAT system, 230-231 
DAT tape/transport format, 232 
DTRS MDM format, 233 
flash memory handhelds, 
227-229 

hard-disk multitrack recorders, 
227 

hard-disk recording, 226-227 
MiniDisc, 229-230 
modular digital multitrack, 
232-233 
overview, 224 
portable studios, 227 
rotary head, 230 
samplers, 225-226 
tutorial, 225 

Digital audio tape (DAT) system 
SCMS, 217 
technology, 230-231 
transport format, 232 
Digital audio technology 
vs. analog example, 200 
audio levels, 214 
basics, 200-201 
as delivery format, 374-375 
dither, 208-209 
fixed vs. floating-point 
Processing, 209 
Nyquist theorem, 205-206 
oversampling, 206-207 
overview, 199-200 
quantization, 203-205 
sampling, 201-203 
signal-to-error ratio, 207-208 
time reference, 408 
transmission, 214-220 
video and picture sync, 411 


Digital audio transmission 
ADAT lightpipe, 218-219 
AES/EBU, 215-216 
MADI, 217-218 
overview, 214-215 
SCMS, 217 
S/PDIF, 216-217 
TDIF, 219-220 

Digital audio workstations (DAW) 
accessories, 244 

audio driver protocols, 251-252 
audio interface, 249-252 
backups and archives, 285-287 
characteristics, 235-238 
Computer noise, 284-285 
Computer power, 282 
controllers, 252-255 
control rooms, 8-9 
desktop Computer, 240-241 
dither, 209 

driver updates, 283-284 
dual monitors, 284 
effects automation, 460-461, 

512 

equalization tutorial, 485 
FireWire, 246-247 
5.1 speaker placement, 550-551 
5.1 surround sound monitoring, 
548 

groove and loop hardware, 
303-306 

hardware considerations, 
238-240 
inline effect, 474 
instrument plug-ins, 341 
laptops, 11-12, 241-244 
lateney, 252 

loop-based audio software, 
295-299 

looping basics, 293-294 
loops into, 305-306 
manual looping, 295 
mastering sequencing, 567 
mic types tutorial, 118 
and MIDI, 310-311, 322 
MIDI recording vs. live, 358-359 
mixdown, 34, 460-461 
mixing systems, 432 
monitoring, 428 
monitor level control, 537 
music printing programs, 
366-367 

networking, 247-249 
production features, 235 
production media, 283 
RAM, 282-283 
read mode, 463 
real-world sync, 410-411 
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ReWire, 302 
rubberbands, 463 
and sequencing, 352-353 
session documentation, 287- 
289, 603 
software 

accelerator cards, 278-279 
basic ftmction, 257-259 
comping, 267 
DSP effects, 271 
DSP plug-ins, 276-277 
MIDI sequencing and scoring, 
267-269 

mixdown and effects 
automation, 279-281 
mixdown exporting, 281 
mixing, 269-279 
ReWire, 277-278 
sound file copy & paste, 263 
sound file recording, 262 
sound files, 259-267 
sound fixing, 265-267 
video and picture sync, 269 
sound source isolation, 27-28 
surround interfacing, 552-553 
system interconnectivity, 

244-249 

timecode routing, 412-413 
USB, 245-246 
virtual mixer technology, 
459-460 

write mode, 461-463 
Digital consoles 

effects automation, 460-461 
mixdown level, 460-461 
read mode, 463 
rubberbands, 463 
technology, 457-459 
write mode, 461-463 
Digital delay devices (DDLs), 
503-505 

Digitally controlled amplifier 

(DCA), function, 421-422 
Digital noise reduction, 518-521 
Digital recording chain 
jitter, 221-222 
recording process, 209-211 
sample rates, 212-213 
signal distribution, 220-221 
sound file bit rates, 213-214 
worldclock, 222-224 
Digital reproduction chain, playback 
process, 211-212 

Digital Rights Management (DRM), 
390 

Digital signal processing (DSP), see 
also Signal processing 
advantages, 470 


effects, 271-276 
noise reduction, 518-521 
plug-ins, 276-277 
quantization, 204 
samplers, 345 

time and pitch change, 292 
Digital Tape Recording System 

(DTRS), MDM format, 233 
Digital technology, 619 
Digital Theater System (DTS), 373, 
554-555 

Digital-to-analog (D/A) converter 
jitter, 222 
oversampling, 206 
Digital video disks (DVDs) 
burning, 589-590 
DAW backups, 286 
as delivery media, 371, 373 
handling and care, 588 
instrument samples, 346-347 
DIN plug, MIDI cable, 315 
Direct injection (DI) box 
electric guitar miking, 154 
electronic keyboard instruments, 
157-158 

interfacing, 146-147 
and leakage, 137 
Direct insertion monitoring, 
442-443 

Directional microphones 
mic selection, 149 
response, 119-122 
Direction perception, 67-69 
Direct signal, reverb, 506 
Direct sound 
definition, 70 
example, 69 
DirectX, 277, 471 

Disc-at-once writing, characteristics, 
584 

Disc-cutting lathe, vinyl disc 
manufacture, 591 
Disc jockey (DI), as personnel, 21 
Discrete time sampling, digital 
signáis, 203 

DiscWelder Steel CD/DVD audio 
authoring tool, 583 
Distant miking, technique, 133-135 
Disto rtion 

limiting, 500 

waveform and time-base, 221 
Distribution amplifiers, function, 
419-420 

Dither 

digital signáis, 208-209 
mastering process, 572 
DI Y, see Do It Yourself (DIY) 
tutorials 


DJ, see Disc jockey (DJ) 

DJ software, 306-307 
DM-1 Delay Modulator, 276 
Do It Yourself (DIY) tutorials 
ambient miking, 141 
ambient surround mies, 146 
analog tape modulation, 514 
asperity noise, 514 
beats, 66 

CD manufacturing, 586-587 
channel select, 459 
cióse mic, 140 
compression, 497 
controller valué, 363 
DAW effects automation, 512 
delay, 506 

digital audio recording systems, 
225 

equalization, 485 
FFT noise reduction, 521 
harmonics, 53 
limiting, 500 
loop-based editor, 296 
manual looping, 295 
masking, 67 
mic types, 118 
MIDI session, 355 
phase, 50 

proximity effect, 125 
punching, 358 
reverb types, 508 
session, 612 
SMPTE timecode, 396 
sound file copy & paste, 263 
sound file recording, 262 
suggestions, 3 

Dolby Digital (AC3), 281, 373, 536, 
547, 550, 553-555 
Dolby Digital Cinema System, 554 
Dolby Film Sound Project, 547 
Dolby noise reduction, 547 
Dolby Pro Logic, 537 
Dolby SR noise-reduction system, 
516 

Dolby Surround, 547 
Domains 

magnetic tape, 183 
orientation, 184 
structural layers, 183 
Doors, and acoustic isolation, 

87-89 

Double bass, miking, 167 
Doubling 

definition, 71 
delays, 504 
Dowd, Tom, 616 

Drawn automation, virtual mixers, 
463 
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Drivers 

DAW software, 283-284 
MIDI data, 339 
Drives, see also Hard-disks 
boot, 286-287 
CD-R/RW quality, 585 
DRM (Digital Rights Management), 
390 

Drop-frame timecode, NTSC video, 
398 
Drums 

drum machine, 349 
hand drum, 166 
kick drum, 161-162, 485, 
496-497 

loop plug-ins, 305 
miking, 160-161 
miking prep, 158-160 
plug-ins, 305 
riser, 87 

snare drum, 57, 162-163, 485, 
496-497 

DSP, see Digital signal processing 
(DSP) 

DTS 96/24, 554 
DTS (Digital Theater System), 
554-555 
DTS-ES, 554 
DTS Interactive, 555 
DTS Neo:6, 554 
DTS Virtual, 555 
Durbin, Deanna, 546 
DVDs, see Digital video disks 
(DVDs) 

Dynamic EQ, 273 
Dynamic microphone 
AKG DI 12, 172, 174 
design, 112-113 
grand piano, 156 
kick drum, 161 
mic selection, 149 
polar patterns, 121 
transient response, 125 
Dynamic plug-in processors, 519 
Dynamic processing 

dynamic vs. compressed, 571 
mastering process, 569-571 
in mixing, 464 
multiband, 571-572 
Dynamic range 

compression, 492-497 
digital audio levels, 214 
DSP effects, 272-273 
expansión, 500-501 
limiting, 498-500 
metering 

averaging meter, 490-491 
overview, 487-491 


peak meter, 490-491 
VLI meter, 489-490 
multiband compression, 498 
noise gate, 501-502 
overview, 487 
processors, 491 
voice miking, 168 
Dynamics section, mixing systems, 
440 

E 

Ear 

auditory perception, 64-65 
direction perception, 67-69 
hearing care, 63 
loudness levels, 57-58 
masking, 66 
overview, 61-62 
space perception, 69-70 
thresholds, 62-63 
Early reflections 
definition, 70 
example, 69 
reverb, 506 
Earthquake, 532 
Echo chamber, characteristics, 
109-110 

Echo jack, MIDI cable, 319 
Editing process 

DAW sound files, 261-265 
effects, 512 
MIDI sequencers 
controller valúes, 362-363 
overview, 359-361 
techniques, 361-362 
E-dub, 391 
Effects devices 

audio-for-film, 17 
control rooms, 9 
project studios, 11 
Effects processing 

automation, 460-461 
automation/editing, 512 
equalization application, 
483-484, 486-487 
equalizer types, 481-482 
hardware, 476-478 
high-pass filters, 480-481 
low-pass filters, 480-481 
in mixing, 464 
overview, 475-476 
peaking filters, 478-480 
shelving filters, 480 
signal paths, 472-475 
Effects rack, 470 
Effects send bus, example, 446 
Ehrmann, Max, 622 
Electret-condenser mic, design, 118 


Electric bass guitar, miking, 154 
Electric guitar 

compression, 496 
frequency, 485 
miking, 153-154 
Electric organ, frequency, 485 
Electronic Arts, 4,18 
Electronic instruments 
components, 340-341 
drum machine, 349 
keyboard controller, 347-349 
keyboards, 342-346 
MIDI drum controllers, 349-352 
overview, 339-352 
percussion, 349-352 
plug-ins, 341-342 
samplers, 343-346 
sample sources, 346-347 
Electronic keyboard, miking, 
157-158 

Electronic music, and MIDI, 311 
Electrostatic principie, condenser 
mic, 115 

Elliot Sheiner (ELS) surround 
sound, 547 

ELS, see Elliot Sheiner (ELS) 
surround sound 

EMI TG12412 mastering outboard 
EQ plug-in, 569 

End of Exclusive (EOX) messages, 
331 
Engineers 

artificial reverb, 71 
chair, 607 
delay units, 71 
drum set miking, 159 
mixdown process, 33 
monitoring overview, 530-531 
as personnel, 20-21 
as producer, 599 
recording process, 28-30 
recording project tips, 601 
Entertainment, tax tips, 629 
Envelope, waveforms, 56-57 
EOX, see End of Exclusive (EOX) 
messages 

EQ, see Equalization (EQ) 

EQF-100 equalizer, 482 
Equalization (EQ) 

application, 483-484, 486-487 
ATRs, 189 
boost/cut, 479-480 
delay, 504 

DSP effects, 272-273 
effects processing, 477-478 
mastering process, 569 
in mixing, 438-440, 463 
pass filters, 480-481 
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peaking filters, 478-480 
room considerations, 524-525 
tutorial, 485 
Equalizers 

example, 438 
function, 418-419 
types, 481-482 
Equipment, general 
basic needs, 607-608 
hardware protection, 610-611 
tax tips, 627-628 
Equivalent noise rating, 126-127 
Erase head, function, 189 
Error correction 

CD manufacturing, 578 
digital recording, 211 
Erskine, Peter, 159 
Even harmonios, definition, 53 
Expander 

noise gate, 501-502 
operation, 500-501 
Expandibility, DAW characteristics, 
237 

Expansión, process, 500-501 

F 

Fabrik C, 274 
Faders 

channel fader, 443-444 
master output fader, 29, 33 
mixdown process, 33-34 
mixing and balancing, 465 
small-fader monitor modes, 440, 
442 

Fading, sound files, 264 
Fans, control room noise, 91-92 
Fantasía, 546 
Fantasound, 546 
Farfield monitoring, 538-539 
Fast Fourier Transform (FFT), 
518-522 

FET, see Field effect transistor (FET) 
FFT, see Fast Fourier Transform (FFT) 
Fiat aluminum disc, 594 
Field effect transistor (FET), 
117-118, 131 

Figure-8 pattern, mic characteristics, 
119 

File resolution, mastering process, 
572 

File sharing, DAW Systems, 247 
File size 

audio, 376 
streaming data, 388 
File tags, metadata, 382 
Film, see also Video 
audio for, 17 

timecode frame standards, 398 


Filters 

anti-aliasing, 206-207 
bandpass, 482 

high- and low-pass, 480-481 
notch, 483 
peaking, 478-480 
shelving, 480 

Finale 2009 music printing 
program, 366 
Final mix 

compression, 497 
definition, 429 

Fingered style, electric bass guitar 
miking, 154 

FireWire, 246-247, 253, 338 
First harmonic, example, 53 
5:1 distance rule, 137 
5.1 surround sound 
and EFE, 535-537 
mixdown exporting, 281 
monitoring 
overview, 548-549 
practical placement, 550-551 
speaker placement, 549-550 
up-mix, 559 

Fixed-point processing, digital 
signáis, 209 

FFAC (Free Fossless Audio Codee), 
380 

Flash, vinyl disc manufacture, 596 
Flashlight, 608 

Flash memory handhelds, 227-229 
Fiat frequeney response curve, 
definition, 48 
Fíats, see Partitions 
Fletcher-Munson curves 
compression, 495 
definition and example, 64-65 
monitor volume, 532 
Flexibility, DAW characteristics, 237 
Flipped mode, in-line monitoring, 
440, 442 

Floating floor, and acoustic 
isolation, 84-86 

Floating-point processing, digital 
signáis, 209 

Floor plans, recording studio 
examples, 5 

Floors, and acoustic isolation, 84-86 
Floor-tom, 164-165, 485 
Flute, miking, 170 
Flutter echo, 71, 99 
FM synthesis, 342, 382 
Foley, audio-for-visual production 
environment, 76 
Frame rate 

desktop video, 386 
timecode standards, 398-399 


Frames, timecode, 394 
Free Fossless Audio Codee (FFAC), 
380 

Freewheeling mode, jam sync, 401 
French horn, mic placement, 
151-152 
Frequeney 

and acoustic space, 73 
and equalization, 484 
French horn mic placement, 
151-152 

human hearing, 44 
interaural arrival-time 
differences, 68 
Nyquist theorem, 206 
stringed instrument miking, 
166-167 
trombone, 151 
trumpet mic placement, 

150-151 
tuba, 151 

waveform characteristics, 45 
wavelength relationship, 47 
woodwind instrument miking, 
169-170 

Frequeney balance, in room design, 
97-102 

Frequeney range, instruments, 485 
Frequeney response 

crossover systems, 527 
definition, 47-48 
distant miking, 134 
drum set miking, 161 
example, 49 

mic characteristics, 123-124 
mic selection, 149 
room considerations, 524 
Front-to-back discrimination, mic 
characteristics, 121 
Full messages, MIDI timecode, 

406 

Functional bass trap, function, 107 
Furman IT-1220 balanced-output 
power conditioner, 456 
Fuzziness, 514 

G 

Gain, compressor, 493-494 
Gain level optimation, metering, 
454-455 

Gain stage chart, example, 454 
GarageBand™ (Apple), 296 
Gate effect, 508 
General MIDI (GM) 
definition, 311-312 
and multimedia, 382-383 
nonpercussion instrument patch 
map, 384-385 
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percussion instrument patch 
map, 385 

Gilfoy Sound Studios, 616 
Glass 

absorption coefficients, 103 
surface density, 82 
GM, see General MIDI (GM) 
Gobos, see Partitions 
Gone with the Wind dialogue, 262 
"Good Rule," 111, 149, 439, 483 
Gotcher, Peter, 615 
Grace MI01 mic preamplifier, 131 
The Grammy llniversity NetWork 
(Grammy U), 624 
Grand piano 

miking, 155-156 
separation problems, 156 
Graphical user interfaces (GUIs) 
looping software, 297 
MIDI mixing, 365 
mixing systems, 432 
music printing programs, 367 
Graphic equalizer, 482-483 
Graphics imaging, methods, 383, 
386 

Green Book CD, 372 
Groove-based tools 
basics, 291-292 
beat slicing, 293 
DAW looping, 293-294 
loop-based audio software, 
294-299 
Reason, 299-302 
time and pitch change, 292 
warping, 292-293 
Groove hardware 
overview, 303 
plug-ins, 303-304 
Groove pitch, stereo recording, 
592-593 

Grounding, related issues, 
455-457 

Group, recording project tips, 601 
Grouping 

mixing systems, 448-449 
recording process, 426-427 
GUIs, see Graphical user interfaces 
(GUIs) 

Guitars 

amp miking, 153 
compression, 496 
frequency, 485 
miking, 152-154 
peak metering, 488 
recording direct, 147 
as transducer, 37 
Gypsum wallboard, 82-84, 103 
surface density, 82 


H 

Haddy, Arthur, 145 
Half-normaled configuradon, patch 
bays, 450 

Half-wrap tape path, 232 
Hall effect, 507 
Hammers, piano miking, 155 
Hand drums, miking, 166 
Handling & care, CDs and DVDs, 
588-589 
Hard-disks 

control room noise, 92 
DAW systems, 244, 286 
and MIDI, 338 
multitrack recorders, 227 
recording, 226-227 
sound file recording, 262 
Hardware, general 
advances, 614-615 
CD manufacturing, 578-579 
DAWs 

accessories, 244 
basic considerations, 238-240 
desktop Computer, 240-241 
laptop Computer, 241-244 
and device drivers, 371 
groove and loop, 303-306 
and MIDI, 312 
MIDI peripherals, 338 
mixing systems, 430-432, 437 
pan Controls, 445 
protection, 610-611 
virtual effects, 476-478 
Harmónica, miking, 170-171 
Harmonics 

combination tones, 66 
content, 52-56 
and equalization, 484 
and psychoacoustics, 508 
tutorial, 53 
voice miking, 168 
HDCD format, 372 
Head 

acoustic shadow, 68 
ATRs, 193-194 
Headphones 

recording process, 28-30, 428 
as speaker type, 541-542 
in studio, 543-544 
Headroom, 489 
Health insurance, tax tips, 629 
Hearing, see also Ear 
care of, 63 
threshold, 62 

Helical sean path, 230-231 
Hermán Miller Aeron chair, 607 
High-frequency absorption, control 
room design, 104 


High-pass filters, effects processing, 
480-481 

Hi-hat 

frequency, 485 
miking, 165-166 
The Hit Factory, 2 
Home office, tax tips, 628 
Home studio, example, 15 
Horns, frequency, 485 
Houston, Whitney, 15 
HoverDeck™ 88 isolation riser, 86 
HRM-16 headphone mixing station, 
544 

Human hearing, amplitude and 
frequency, 44 

Humanization process, MIDI 
editing, 362 

Hypercardioid microphones, 

Neumann KM 180 series, 175 
Hypercardioid polar patterns, mic 
characteristics, 120-122 

i 

IC, see Integrated circuit (IC) 

ICON Integrated consolé, 424 
Imaging, center and acoustic, 94-95 
Impedance, microphones, 127-128 
Industry considerations 

career development, 24-26 
professional fields, 23-24 
women and minorities, 23 
In jack, MIDI, 318 
Indine monitoring, mixing systems, 
440-442 
Inline routing 

on DAW track, 474 
external control, 473-475 
overview, 472-473 
Inner ear, diagram, 62 
In-phase 

definition, 49-50 
example, 50 
speaker polarity, 529 
Input attenuation pads, 433 
Input gain, compressor, 493 
Input mode, ATRs, 190 
Input strip 

Mixer 14:2, 435 
V-pot, 458 
Insertion loss, 478 
Insert point, mixing systems, 
433-435 
Instruments 

DI box interfacing, 146-147 
distant miking, 133 
electronic, 339-352 
equalization, 484 
frequency ranges, 485 
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GM nonpercussion instrument 
patch map, 384-385 
grouping, 427 
harmonics, 52 
leakage, 136-137 
MIDI channels, 326 
miking, 112 
brass, 150-152 
guitar, 152-154 
keyboard, 155-158 
percussion, 158-166 
stringed, 166-167 
woodwinds, 169-170 
mixing and balancing, 465 
overdubbing, 31 
recording process, 28 
timbre, 56 
track recording, 426 
violin vs. viola harmonics, 54-55 
Insurance, 610 

Integrated Circuit (IC), 131, 614, 619 
Integration, DAWs, 236-238 
Interaural arrival-time differences, 
definition and example, 

67-68 

Interchange, sound file formats, 
256-257 
Interconnectivity 

DAW Systems, 244-249 
MIDI systems, 314-316 
International Telecommunications 
Union (ITU), 5.1 speaker 
placement, 549 
Internet, see World Wide Web 
(WWW) 

Internet radio, 391 

ION Digital Media Workstation, 

242 

I/O ports, MIDI interace, 338 
IRS, see Tax tips 
ISO-0660 format, 372 
Isolation, sound sources, 27 
Isolation booths (iso-booths), 

90-91 

Isolation cabinets, guitar amp 
miking, 153 

Isolation rooms (iso-rooms), and 
acoustics, 89-90 
ITU, see International 

Telecommunications Union 
(ITU) 

iTunes® (Apple), 389 

iTunes® media player (Apple), 389 

j 

Jacks 

MIDI, 318-319 
MIDI Echo, 319 


Jam sync, modes, 401 
Jitter, digital recording chain, 
221-222 

Jogging, analog sequence editing, 36 
Joliet format, 372 

K 

Katz, Bob, 221 
Keyboards 

as drum controllers, 351 
and MIDI, 338 
MIDI keyboard controller, 
347-349 
miking, 155-158 
samplers, 343-346 
synth, 342-343 
workstation sequencers, 353 
Key input, noise gate, 502 
Keys, Alicia, 76 
Key side-chain input, 474 
Kickboard, piano miking, 157 
Kick drum 

compression, 496-497 
frequency, 485 
miking, 161-162 
Knowledge, in studio assembly, 
12-13 

L 

Label printer, 588, 608 

Lacquers, 594-595 

LAN, see Local area network (LAN) 

Land, disc grooves, 593 

Laptops 

DAW hardware, 241-244 
as portable studios, 11-12 
production portability, 621 
Large-scale integrated (LSI) circuit, 1 
Latency, DAW systems, 252 
Lawyers, 22-23 
Lazar, Emily, 564 
LE, 570 

Lead, surface density, 82 
Lead screw, 592 
Leakage 

cióse miking, 136-137 
reduction, 138 
LED flashlight, 608 
LENRD™ bass traps, 107 
Leslie cabinet, miking, 158 
Lexicón 960L multi-effects system, 
511 

Lightpipe system, 218-219 
Limiters 

íunction, 498-500 
tutorial, 500 

Linear curves, example, 58 
Line trim, 432 


Live (Abelton), 297-299 
Live effect, 508 
Live/ondocation recordings, 
creado n, 14-16 
Live room 
layout, 101 
leakage, 137 
reflections, 99 

Local area network (LAN), 247-249, 
387 

The Lodge, 564 
Logarithm (log) 
basics, 58-59 
definition, 58 
example, 58 
Logic DAW, 237 
Longitudinal timecode (LTC) 
method, 399 

SMPTE signal distribution, 403 
Loop files, sources, 307-308 
Loop hardware 

drum and drum loop plug-ins, 
305 

loop into DAW, 305-306 
overview, 303 
plug-ins, 303-304 
Loop software 

audio software, 295-299 
beat slicing, 293 
and DAW, 293-294 
DAW tutorial, 295 
tempo, 292 
tutorial, 296 

Lossy, digital sample rate, 212 
Loudness levels 
decibel, 59-61 
logarithm basics, 58-59 
overview, 57-58 

Low-frequency absorption, control 
room design, 104-107 
Low-frequency effect (LLE) 

Dolby Digital, 553 

5.1 speaker placement, 550 

monitoring, 532 

surround active monitors, 552 

surround with, 536-537 

surround without, 535 

theater with, 537 

Low-pass filters, effects processing, 
480-481 

LSI (large-scale integrated) circuit, 1 
LTC, see Longitudinal timecode 
(LTC) 

Lurssen Mastering, 564 

M 

MacBook Pro (Apple), 243 
Machine room, 8 
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Macintosh HFS format, 372 
Mackie Control Universal DAW 
controller, 253 

Mackie 8-bus consolé, 425, 430, 
431, 441, 447, 449 
Mackie HR626 monitor speakers, 
551 

Mackie HR824mk2 active monitor 
speaker, 525, 540 
Mackie Onyx 1200F, 250 
Mackie Universal Control, 461 
Mac Pro™ (Apple), 241 
MADI protocol, 217-218 
Magazines, continuing education, 
623 

Magnetic tape, see also Analog tape 
recorder (ATR) 

dynamic range metering, 489 
erase head, 189 
as media, 182-183 
playback head, 188 
tape head, 187-189 
time-length relationship, 186 
transport, 184-187 
Magneto-optical (MO) technology, 
229 

Magnetophones, example, 182 
Magnets, ribbon mies, 114 
Maintenance engineer, as personnel, 
21 

Manley Massive Passive Stereo 
Equalizer, 477 
Manley Stereo Variable Mu 
mastering outboard 
compressor, 570 

Manual looping, DAW tutorial, 295 
Manufacturing 

CD/DVD handling, 588-589 
CD-R/RW media, 584-586 
CDs, 577-584 
DIY CDs, 586-587 
facility choice, 575-577 
recording process, 36 
vinyl disc, 590-596 
for Web, 596 
Marimba, miking, 166 
Marker tracks, DAW session 

documentation, 287-288 
Marketing, recording process, 

36-37 

Mark of the Unicom Digital 
Performer, 258 
Martin, George, 7 
MAS (MOTU Audio System), 252, 
277, 471 
Masking 

definition, 66 
tutorial, 67 


Massenburg, George, 545, 621 
Mass in C Minor (Mozart), 151 
Master disc 

CD manufacturing, 579 
CD-R problems, 585 
Mastering engineer 
as personnel, 21 
process decisions, 566 
recording process, 34-35 
Mastering process 

digital audio editor, 572-573 
dither, 572 
dynamics, 569-571 
equalization, 569 
file resolution, 572 
multiband dynamic processing, 
571-572 

overview, 563-565 
preparation, 566-567 
project details, 567 
recording process, 34-35 
relative volumes, 568-569 
requirements, 565-566 
sequencing, 567-568 
surround sound, 553-559 
vinyl disc manufacture, 595 
Master mix, definition, 429 
Master output fader, 29, 33 
Master/slave relationship, real-world 
sync, 409 

MasterX5 48k multiband dynamics, 
275 

Material specifications 

absorption coefficients, 103 
surface densities, 82 
Matrix MS processor, 144 
Matrix vinyl dises, 595 
M-Audio Backpack, 244, 621 
M-Audio controller, 330 
M-Audio Fast Track Ultra, 250 
M-Audio MidAir 25, 317 
M-Audio MIDISPORT, 339 
M-Audio Octane 8-channel preamp, 
219 

M-Audio Torq DJ Performance 
production software, 307 
M-Audio Trigger Finger, 351 
McDSP ML4000H Mastering 
Plug-ins, 570 
MD, see MiniDisc (MD) 

MDM, see Modular digital 
multitrack (MDM) 

Media device synchronization 
digital audio timing reference, 
408 

LTC and VITC timecode, 

399-400 

overview, 393-394 


SMPTE timecode, 396, 401-403 
sync information data, 396 
timecode frame standards, 
398-399 

timecode overview, 394-395 
timecode refreshment/jam sync, 
400-401 

timecode word, 395-396 
video timing reference, 407 
Meetings, recording project 
preparation, 601-603 
Memory, 340, see also RAM 

(random access memory) 
Mercury Records, 4 
Messages 

channel voice, 327-330 
MIDI editing, 360 
MIDI real-time, 404-406 
MIDI specs, 323-324 
Sys-Ex, 322, 335-337 
system, 331, 335 
Metadata, tagged, 380-381 
Metering 

compressor display, 494 
dynamic range 
averaging meter, 490-491 
overview, 487-489 
peak meter, 490-491 
VU meter, 489-490 
mixing systems, 451-455 
Meyer, Chris, 405 
Mic/line channels, recording 
process, 28 

Microgroove pitch, vinyl disc 
cutting, 594 
Microphone design 
characteristics, 112 
condenser mic, 115-119 
dynamic mic, 112-113 
ribbon mic, 113-115 
Microphone placement 
accent mic, 140 
acoustic guitar, 152-153 
ambient mies, 141 
cióse mic, 135-140 
distant miking, 133-135 
drum set, 158-161 
electric bass guitar, 154 
electric guitar, 153-154 
electronic keyboard, 157-158 
floor-toms, 164-165 
French horn, 151-152 
grand piano, 155-156 
hi-hat, 165-166 
kick drum, 161-162 
overheads, 163-164 
overview, 111-112, 148-150 
rack-toms, 164-165 
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snare drum, 162-163 
stringed instruments, 166-167 
trombone, 151 
trumpet, 150-151 
tuba, 151 

tuned percussion instruments, 
166 

upright piano, 156-157 
voice, 168-169 
woodwind instruments, 

169-171 

Microphones, general 
balanced Une, 128-129 
cancellations, 52 
characteristics, 119-130 
definition, 111 

directional response, 119-122 
equivalent noise rating, 126-127 
frequency response, 123-124 
impedance, 127-128 
omnidirectional, 119 
output characteristics, 125-130 
overload characteristics, 127 
preamps, 130-132 
recording process, 28 
sensitivity rating, 125-126 
as transducers, 37-38 
transient response, 125 
trims, 432 
tutorial, 118 
unbalanced lines, 130 

Microphone selection 
AKG C3000B, 176, 178 
AKG DI 12, 172, 174 
Beyerdynamic MI60, 172-173 
guidelines, 149 
MXLV67Í, 177-178 
Neumann KM 180 series, 175, 
177 

overview, 111-112 
Royer labs R-121, 174, 176 
Shure SM57, 171-172, 173 
Telefunken M216 stereo, 179 

Microphone techniques 
accent mic, 140 
ambient mies, 141 
ambient surrond mies, 145 
cióse mies, 135-140 
overview, 132 

pickup characteristics, 132-133 
reamping in mix, 147-148 
recording direct, 146-147 
stereo miking, 142-145 
surround Decca tree, 146 
surround miking techniques, 
145-148 

3:1 distance rule, 137 

Middle ear, diagram, 62 


MIDI cable 

Echo jack, 319 
íunction, 315-316 
MIDI message, 323 
and phantom power, 316 
MIDI channels, 324-326 
MIDI interface 

definition, 338-339 
sync with SMPTE timecode, 
401-402 

MIDI Manufacturers Association 
(MMA), 331 

MIDI messages, 323-324, 360, 406 
MIDI modes, 326-327 
MIDI (musical instrument digital 
interface) 

to audio tracks, 364 
backups and archives, 606-607 
and computers, 337-339 
controller tutorial, 363 
effeets automation, 512 
electronic instruments 
components, 340-341 
keyboard controller, 347-349 
keyboards, 342-346 
overview, 339-340 
percussion, 349-352 
plug-ins, 341-342 
samplers, 343-346 
sample sources, 346-347 
file saving, 359 

general MIDI, 311-312, 382-383 
jacks, 318-319 
limitations, 313-314 
looping software, 299 
and multimedia, 381-382 
music printing programs, 
365-367 
overview, 309 

peripheral connections, 338 
phantom power, 316-317 
playback, 363-364 
production environments, 
310-312 

purpose, 312-313 
sequence mixing, 365 
sequencing, 352-354 
sequencing and scoring, 267-269 
specs, 322-337 
standard MIDI, 382 
synchronization, 404-405 
system interconnections, 

314-316 

typical system, 313 
wireless, 317-318 
MIDI sequencers 
characteristics, 312 
daisy chain, 321 


editing 

controller valúes, 362-363 
overview, 359-361 
techniques, 361-362 
integrated workstation, 353 
messages, 329 
music printing programs, 
366-367 

recording, 354-358 
software-based, 353-354 
MIDI session 
click track, 356 
múltiple tracks, 357 
overview, 354-355 
recorded vs. live, 358 
step time entry, 358 
tempo, 355-356 
MIDI specs 

Channel Voice messages, 
327-330 

Control Change messages, 

329- 330 

Controller ID numbers, 335-337 
Controller ID parameters, 

330- 331 

Sys-Ex messages, 335-337 
System messages, 331, 335 
MIDI systems 

daisy chain, 319-321 
multiport network, 321-322 
typical configurations, 319-322 
MIDI timecode (MTC) 

control structure, 405-406 
definition, 331 
overview, 405 
real-world sync 
audio recorders, 409-410 
Computer routing, 411-413 
DAWs, 410-411 
master/slave relationship, 409 
overview, 408 
software, 410 
VCRs, 410 

video and picture sync, 411 
SMPTE timecode conversión, 406 
usage guidelines, 413 
Mid-side (M/S) miking, techniques, 
143-145 

Miking, see Microphone placement 
Millenia HV-3D mic preamp, 131 
MiniDisc (MD), 229-230 
Minorities, in industry, 23 
Mission statement, recording 
project, 600 
Mixdown 

automation, 279-281, 460-461 
compression, 495 
exporting to file, 281 
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recording process, 32-34, 429 
recording project tips, 605-606 
Mixing 

and balancing, 463-467 
compression, 496 
control example, 476 
DAW software, 269-279 
DSP effects, 271-276 
MIDI sequence, 365 
in surround, 559-561 
Mixing systems 

auxiliary send, 435-438 
channel assignment, 446-448 
channel fader, 443-444 
channel input, 432-435 
component overview, 429-432 
consolé prototype, 614 
dynamics section, 440 
equalization, 438-440 
fimction, 7 
grouping, 448-449 
metering, 451-455 
mixdown mode, 33 
monitor section, 440-443 
output section, 444, 446 
purpose, 423 

virtual technology, 457-463 
MMA, MIDI Manufacturers 
Association (MMA) 

MO, see Magneto-optical (MO) 
technology 

Modular digital multitrack (MDM), 
230, 232-233, 403 
Modulation noise, 514 
Monitoring, see also Speakers 
bass management, 533 
in cars, 543 

configurations, 533-534 
for DAW, 284 
definition, 523 
farfield, 538-539 
5.1 surround sound 
overview, 548-549 
practical placement, 550-551 
speaker placement, 549-550 
headphones, 541-542 
level control, 537 
low frequency effect, 532 
mixing systems, 440-443 
modes, ATRs, 190-192 
monitor level section, 449 
mono, 534 
nearfield, 539-540 
overview, 530-531 
patch bay, 449-451 
professional, 524 
quad, 535 

recording process, 428 


small speakers, 541 
speaker design, 525-528 
speaker and room, 523-525 
spectral reference, 538 
stereo, 534 

stereo + substitution, 535 
in studio, 543-544 
subwoofer, 532-533 
surround without LFE, 535 
theater with LFE, 537 
volume, 531-532 
Monoaural compatibility, M/S 
techniques, 144 
Monoaural monitoring, 534 
Monochrome ll.S. video standard, 
398 

Mono mix, compression, 496 
Mono Music, 424 
MOTU Audio System (MAS), 252, 
277, 471 

MOTU 828mk3, 250, 537 
Mozart, Wolfgang Amadeus, 151 
MPEG-1, 372-373 
MPEG-2, 371-373 
MPEG-3 (MP3) 

backups, 285, 287 
coding schemes, 377-378 
DRM, 390-391 
MiniDisc, 229-230 
players, 245, 338, 495 
SRS, 557 

MPEG-4 (MP4), 378, 557, 559 
M-Powered, 570 
M/S, see Mid-side (M/S) miking 
MTC, see MIDI timecode (MTC) 
Mullin, John (Jack) T., 182, 
613-615 

Multiband compression, 498 
Multiband dynamic processing, 
571-572 
Multimedia 

CD delivery, 371-372 
copyright protection, 390 
definition, 369 
delivery formats, 374-375 
desktop video, 386-387 
DVD delivery, 371, 373 
as environment, 369-371 
general MIDI, 382-383 
graphics imaging, 383, 386 
Internet radio, 391 
and Internet speed, 387-388 
MIDI, 381-382 
perceptual coding 
AAC, 379 
FLAC, 380 
MP3, 377-378 
MP4, 378 


overview, 375, 377 
Real Audio, 379-380 
WMA, 378-379 
secure digital music, 390-391 
standard MIDI files, 382 
tagged metadata, 380-381 
virtual e-dub, 391 
web distribution, 388-390 
Multimedia audio, field growth, 
17-18 

Multiple-effects devices, 511 
Multiport network, MIDI systems, 
321-322 

Multisession dises, CD-R problems, 
585 

Multisession writing, characteristics, 
584 

Multitasking, multimedia 
environment, 370 
Multitimbral instruments, MIDI 
channels, 326 

Multitrack recorder, sound source 
isolation, 27 
Museums, 618 

Musical octave, definition, 52-53 
Musicians 

networking, 620 
professionalism, 608-609 
recording session tips, 604-605 
Music law 

importance, 22-23 
recording project tips, 599 
Music notation program, 366 
Music printing programs, 365-367 
Music studio, professional role, 
9-10 

Musikmesse, 624 

Mute fimction, channel fader, 444 
MXL V67i condenser microphone, 
177-178 

Mylar diaphragm, dynamic mic, 
112-113 

N 

NAMM, see National Association of 
Music Merchant (NAMM) 
National Association of Music 
Merchant (NAMM), 624 
Nearfield monitoring, 539-540 
Neato CD Labeler kit, 588 
Networking 

career development, 25 
musicians, 620 
Networks 

crossover, 526-527 
DAW system, 247-249 
MIDI system connections, 313 
Neumann KM 180 series, 175, 177 
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Neve 88RS, 431, 447 
Noise 

control room, 91-92 
DAW computers, 284-285 
dynamic ranges, 492 
Noise gate 

function, 501-502 
key side-chain input, 474 
noise reduction, 517-519 
Noise reduction 
analog, 513-516 
digital noise reduction, 518-521 
FFT-based, tutorial, 521 
overview, 513 
and restoration, 522 
single-ended noise-reduction 
process, 516-518 
Non-drop-frame code, timecode 
frame standards, 398 
Non-real-time DSP, 271 
Normaled configuration, patch bays, 
450-451 

Normalization, definition, 264 
Notch filter, 483 
Note-off messages, 327-328 
Note-on messages, 327, 356 
Novation Nocturn plug-in 
controller, 473 
Novation 25SL USB MIDI 
Controller, 348 
NTSC video, 398 
Nuendo, see Steinberg Cubase/ 
Nuendo 

Nyquist theorem, 205-206 

o 

Obstacles, effects on sound, 48 
OD, see Overdubbing (OD) 

Odd harmonics, definition, 53 
Offset times, SMPTE timecode, 
402-403 

Ohms, definition, 127 
Olson, Harry F., 107 
OMFI, see Open Media Framework 
Interchange (OMFI) 
Omnidirectional microphone 
definition, 119 
mic selection, 149 
Neumann KM 180 series, 175 
Omni Off/Mono, MIDI modes, 327 
Omni Off/Poly, MIDI modes, 327 
Omni On/Mono, MIDI modes, 
326-327 

Omni On/Poly, MIDI modes, 326 
One Hundred Men and a Girl, 546 
op-amp, see Operational amplifier 
(op-amp) 

Open configuration, patch bays, 450 


Open Media Framework Interchange 
(OMFI), 256 
OpenTL, 256 

Operating systems (OS), signal 
Processing plug-ins, 471 
Operational amplifier (op-amp), 
function, 418 
Orange Book CD, 372 
Organizations, continuing 
education, 624 

Oriented strand board (OSB), 
floating floors, 85 

OSB, see Oriented strand board (OSB) 
Otari MX-5050 B3 two-channel 
recorder, 184 
Outer ear, diagram, 62 
Out jack, MIDI, 318 
Out-of-phase 
definition, 49 
speaker polarity, 529 
Output gain, compressor, 493 
Output impedances, mies, 127-128 
Output section, mixing systems, 

444, 446 
Oven Studios, 76 
Overdubbing (OD) 

ATRs, 191 

recording process, 31-32, 
428-429 

Overhead microphones, drum sets, 
163-164 

Overload characteristics, 
microphones, 127 
Oversampling, digital conversión, 
206-207 

Overtones, definition, 52 

P 

Paisley Park's Studio A, 5 
PAL video, timecode frame 
standards, 398 
Pankow, Jimmy, 616 
Panning 

definition, 69 

hardware/software Controls, 445 
Pan pot, 69, 253, 443, 445, 448 
Parallel configuration, patch bays, 
451 

Parallel data transmission, MIDI 
message, 323 

Parallel routing, characteristics, 475 
Parametric equalizer, 482 
Partials, definition, 52 
Particleboard, surface density, 82 
Partitions 
acoustic, 92 

homemade example, 94 
S5-2L "Sorber" baffle, 93 


Passband, 480 
Passive monitors 
design, 526-528 
in surround, 551-552 
tuning, 525 
wire gauges, 530 
Patch bay, monitoring systems, 
449-451 

Patch data, Sys-Ex messages, 336 
Patch map, GM instruments, 
384-385 

PatchWerk Recording, 3, 470 
PCM, see Pulse-code modulation 
(PCM) 

Peak amplitude valué, definition, 43 
Peak-indicating meter, dynamic 
range, 488 

Peaking filters, effects processing, 
478-480 

Peak levels, dynamic ranges, 492 
Peak meter, dynamic range, 

490-491 

Peak program meters (PPMs), 
mixing systems, 451-452 
Peak-to-peak valué 
definition, 43 

dynamic range metering, 488 
Perception 

auditory perception, 64-66 
direction, 67-69 
space, 69-70 
Perceptual coding 
AAC, 379 
definition, 375 
FLAC, 380 
MP3, 377-378 
MP4, 378 
overview, 375, 377 
Real Audio, 379-380 
WMA, 378-379 
Percussion instruments 
drum machine, 349 
floor- and rack-tom frequeney, 
485 

GM, patch map, 385 

MIDI drum controllers, 349-352 

miking 

drum set, 158-161 
floor-tom, 164-165 
hi-hat, 165-166 
kick drum, 161-162 
overheads, 163-164 
rack-toms, 164-165 
snare drum, 162-163 
tuned, 166 

Performance controllers, electronic 
instruments, 340 
Period, definition, 46 
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Periodic sampling, digital signáis, 
202 

Personal production, professional 
studio role, 9 
Personnel 

artist, 19-20 

assistant engineer, 20-21 
DJ, 21 

engineer, 20 

maintenance engineer, 21 
mastering engineer, 21 
overview, 18 
producer, 19-20 
studio musicians and arrangers, 
19-20 
VJ, 22 

Pews, absorption coefficients, 103 
Phantom power 

condenser mic, 117-118 
MIDI, 316-317 
ribbon mies, 115 
Phase 

room reflections, 98-99 
tutorial, 50 

waveform characteristics, 48-51 
Phase shift, definition, 52 
Piano 

frequeney, 485 
miking, 155-157 
Pickup 

bi- and nondirectional patterns, 
121 

and signal level, 60 
and working distance 
accent miking, 140 
ambient miking, 141 
cióse miking, 135-140 
distant miking, 133-135 
overview, 132-133 
Picture sync 

DAW software, 269 
real-world, 411 
Pinna, ftmetion, 68 
Pins, MIDI, 316 

Pitch Bend Change messages, MIDI 
specs, 328-329 
Pitch change 

effeets, 509-511 
groove tools, 292 
Pitch control 

DSP effeets, 276 

vinyl disc-cutting lathe, 592-594 
Pitch shifting, 509-510 
Pits, CD manufacturing, 577-578 
Planning, recording project tips, 
597-598 

Plaster, absorption coefficients, 103 
Píate effect, 508 


Plating, vinyl disc, 595-596 
Playback 

ATR curves, 190 
digital reproduction chain, 
211-212 
MIDI, 363-364 
stereo, soundfield, 465 
Playback head, magnetic tape, 188 
Plug-ins 

effeets, 476 

electronic instruments, 341-342 
equalization, 477-478 
example, 499 
mixing systems, 437, 438 
signal processing, 471 
Plywood 

absorption coefficients, 103 
floating floors, 85 
surface density, 82 
Polarity, speakers, 529-530 
Polar patterns 
definition, 119 
drum set miking, 161 
types, 120-122 

Polyphonic Key Pressures, MIDI 
specs, 328 

Polyvinyl chloride (PVC), magnetic 
tape media, 183 
Poniatoff, Alexander M., 614 
Pops 

removal, 522 
sound file editing, 265 
Portable production systems, 
benefits, 11 
Portable studio 

characteristics, 11-12 
recording systems, 227 
as studio type, 78-79 
truck example, 16 
Portastudio (Tascam), 227 
Post-production (post), audio-for- 
visual production 
environment, 75 

Power 

balanced, 456-457 
and decibel, 60-61 
related issues, 455-457 
Power amplifiers, function, 

420-421 

Power conditioning, 457 
Powercore DSP processor, 273-275 
Powercore FireWire PCle DSP 
processor, 280 

Powerplay Pro-8 headphones, 543 
PPMs, see Peak program meters 
(PPMs) 

Practice, recording project, 600-601 
Practice rundown, songs, 30 


Preamplifiers (preamp) 

dynamic range metering, 489 
examples, 219 
function, 418 
mies, 130-132 
monitor level control, 537 
Precisión Limiter, 274 
Preparation stage 

mastering, 566-567 
recording project tips, 597-598, 
600-603 

PreSonus AudioBox USB, 249 
PreSonus TubePre, 131 
Pressing, vinyl disc, 595-596 
Pressure-zone bass trap, function, 
105, 107 

Primacoustic™ Polyfuser, 106 
Primacoustic™ Razorblade quadratic 
diffuser, 100-101 
Printed ribbon microphone, 115 
Print-through, ATRs, 194-195 
Producer 

as personnel, 19-20 
recording process, 29-30 
recording project tips, 598-599, 
601 

Producers & Engineers (Recording 
Academy), 197 

Production environments, and 
MIDI, 310-312 

Production studio, all-digital, 38 
Product manufacturing 

CD/DVD handling, 588-589 

CD-R/RW, 584-586 

CDs 

burning software, 579-584 
DI Y, 586-587 
process, 578-579 
technical aspeets, 577-578 
facility and manufacturer, 
575-577 

recording process, 36 
vinyl disc 

cutting head, 591-593 
disc cutting, 591-594 
overview, 590 
pitch control, 593-594 
plating and pressing, 595-596 
recording dises, 594 
for Web, 596 

Professional demeanor, 611 
Professional music studio, 
characteristics, 74 
Program change messages, MIDI 
specs, 328 
Project studio 

characteristics, 10-11 
flexibility, 13-14 
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home-based, 9 
and knowledge, 12-13 
as legal business, 625-626 
self-subsidization, 14 
as studio type, 76-78 
Pro Tools HD 
edit screen, 360 

example, 236, 251, 260, 272, 570 
4-band EQII, 477-478 
Mod Delay II, 503 
M Powered on-screen mixer, 

270, 460 

virtual mixer example, 462 
virtual mixer strip layout, 433 
Proximity effect 

definition, 124-125 
drum set miking, 161 
tutorial, 125 
voice miking, 168 
Psychoacoustics 
definition, 64 
enhancements, 508-509 
Pulse-code modulation (PCM) 
digital recording, 211 
format, 373 

uncompressed sound file 
formats, 375 
Punch-ins 

ATRs, 192-193 
definition, 31 
MIDI recording, 357 
sound files, 266 
tutorial, 358 
Punch-outs 

MIDI recording, 357 
tutorial, 358 
Putnam, Bill, 4, 614 
PVC, see Polyvinyl chloride (PVC) 
PZM-6D boundary microphone, 

135 

9 

Q, see Charge (Q); Quality 
factor (Q) 

QSC Audio RMX 1850HD 

professional power amplifier, 
421 

Quadraphonic sound (quad), 
535-536, 546 

Quality factor (Q), 478-479 
Quantization 

digital signáis, 203-205 
MIDI editing, 361 
Quarter-frame messages, MIDI 
timecode, 406 

Quarter-wavelength bass trap 
example, 106 
function, 104-105 


Quick Sound Field, 108 
Quicktime (Apple), 378, 557 

R 

R, see Resistors (R) 

Rack-toms, 164-165, 485 
Radial JDI passive direct box, 147 
Radiado n 

cello patterns, 56 
obstacle effects, 48 
Radio-frequency interference (RFI) 
and MIDI cable, 316 
power-related issues, 455 
Rain Recording FiveBook™, 242 
Rain Recording StormDrive, 239 
RAM (random access memory) 
for DAW, 282-283 
digital delay, 503 
laptops, 244 
samplers, 225, 343-344 
Rane GE 130 graphic equalizer, 483 
Rapture instrument plug-in, 304 
Rare-earth magnet, ribbon mies, 

114 

Rarefaction, 42-43 
Ráster graphics, 383 
Raven's Gate, 328 
Read mode, 463 
RealAudio, 379-380 
Real-Time Audio Suite (RTAS), 277, 
471 

Real-time DSP, 271 
Real-time messages, 404-405 
Real-world synchronization, 
timecodes 

audio recorders, 409-410 
Computer routing, 411-413 
DAWs, 410-411 
master/slave relationship, 409 
overview, 408 
software, 410 
VCRs, 410 

video/picture sync, 411 
Reamping process, 147-148 
Reason Mixer 14:2 input strip, 435 
Reason NN-19 sample modules, 

345 

Reason NN-XT sample modules, 
301, 345 

Reason software, 299-302, 345 
Reason SubTractor polyphonic 
synth module, 301 
Record head function, 187-189 
The Recording Academy, 624 
Recording consoles, see Mixing 
Systems 

Recording direct techniques, 
146-147 


Recording Industry Association of 
America (RIAA), 390 
Recording process 
compression, 495 
marketing and sales, 36-37 
mastering, 34-35 
MIDI session 
click track, 356 
múltiple tracks, 357 
overview, 354-355 
step time entry, 358 
tempo, 355-356 
tutorial, 358 
mixdown, 32-34, 429 
monitoring, 428 
overdubbing, 31-32, 428-429 
preparation, 26 
product manufacturing, 36 
recording phase, 27-31, 

425-428 

song sequence editing, 35-36 
sound files, 259-261 
stages, 26, 424-425 
Recording project tips 

backups and archives, 606-607 
equipment needs, 607-608 
hardware protection, 610-611 
investment protection, 610 
meetings and preparation, 
601-603 

mixdown, 605-606 
musician professionalism, 608 
music lawyer, 599 
practice concerts, 609-610 
preparation, 600-601 
preparation and details, 597-598 
producer role, 598-599 
professional demeanor, 611 
recording, 604-605 
session documentation, 603-604 
set-up, 603 
software updates, 611 
tutorial, 612 
Recording studio 
acoustic isolation 
ceilings, 87 
floors, 84-86 
iso-booths, 90-91 
iso-rooms, 89-90 
overview, 80-81 
risers, 86-87 
walls, 81-84 

Windows and doors, 87-89 
acoustic partitions, 92 
acoustics overview, 79-80 
all-digital, 38 
characteristics, 4-7 
equipment needs, 607-608 
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example, 2-4, 6, 15 
floor plans, 5 
management, 22 
portable, 11-12 
professional role, 9-10 
Record Plant, 616 
Red Book CD, 372, 580 
Reflection 
acoustic, 95 

control room design, 107-110 
and room design, 97-99 
sound, 46-47 
Release 

compressor, 494 
definition, 57 
noise gate, 501-502 
Resistors (R), condenser mic, 117 
Resource Information File Format 
(RIFF), 375 

Retirement plan, tax tips, 630 
Reverberation 

and acoustic space, 73 
characteristics, 506-507 
control room design, 107-110 
definition, 70-71 
example, 69 
tutorial, 508 
types, 507-508 
Reverberation time (RT) 
definition, 71 
equation, 108 
Reverse effect, 508 
ReWire, 277-278, 302, 341 
RFI, see Radio-frequency interference 
(RFI) 

Rhythm tracks, 425-426 
RIAA (Recording Industry 

Association of America), 390 
Ribbon microphone 

Beyerdynamic MI60, 172-173 
design, 113-115 
mic selection, 149 
Royer labs R-121, 174, 176 
transient response, 125 
RIFF (Resource Information File 
Format), 375 
Risers, 86-87, 161 
rms, see Root-mean-square (rms) 
valué 

Rocking, analog sequence editing, 

36 

Rock Ridge format, 372 
Romeo format, 372 
Room effect (reverb), 508 
Rooms 

control, see Control room 
"Uve," 99, 101, 137 
monitoring, 523-525 


Root-mean-square (rms) valué, 
43-44 

Rotary head technology, 230 
Routing 

inline, 472-475 
parallel, 475 

timecode with Computer, 
411-413 

Royer labs R-121 ribbon 

microphone, 174, 176 
RT, see Reverberation time (RT) 
RTAS, see Real-Time Audio Suite 
(RTAS) 

Rubberband control 
DAW software, 280 
virtual mixers, 463-464 
Rumble 

floors, 84, 433 
LFE channel, 532 
mic characteristics, 123 
Running order, 568 

s 

Sadie PCM2 Editing System, 573 
Safety switch, ATRs, 187 
Sales, recording process, 36-37 
Sample-and-hold (S/H) circuitry, 
205, 208, 209-210 
Sample libraries, DAWs, 249 
Sample rate 

digital audio, 212-213 
digital signáis, 202 
mastering process, 572 
sound files, 257 
Samplers 

electronic keyboards, 157-158, 
342, 343-346, 353 
function, 225-226 
history, 614 

instrument plug-ins, 341 
laptops, 244 
MIDI production, 311 
project studio, 11 
Sampling process 

digital audio, 201-203 
oversampling, 206-207 
quantization, 203, 205 
Sand 

acoustic properties, 81, 82 
surface density, 82 
Saturation, amplification process, 
416 

Saxophone, miking, 170-171 
SCMS (Serial Copy Management 
System), 217 

Score mixdown, audio-for-visual 
production environment, 

75 


Seo ring 

audio-for-visual production 
enviro nment, 75 
MIDI, 267-269 
SDDS format, 373 
.sd format, 255 
.sd2 format, 255 
SDMI (Secure Digital Music 
Initiative), 379, 390 
Second harmonic, example, 53 
Secure Digital Music Initiative 
(SDMI), 379, 390 
Selectable frequeney equalizer, 482 
Self-employment tax, 628-629 
Sensitivity rating, microphones, 
125-126 

Separation, grand piano miking, 156 
Separation loss, ATRs, 196 
Sequencers, see MIDI sequencers 
Sequencing 
basics, 354 

mastering process, 567-568 
and MIDI, 352-354 
MIDI editing, 267-269 
song editing, 35-36 
Serial Copy Management System 
(SCMS), 217 

Serial data transmission, MIDI 
message, 323 

Session documentation, 287-289, 
603-604 

785 Records & Publishing, 11 
S/H, see Sample-and-hold (S/H) 
circuitry 

Sheffield mobile recording truck, 16 
Shelving filters, 480 
Sheppard, Tom, 10 
Shure Beta 58A dynamic mic, 113 
Shure SM57 microphone, 171-172, 
173 

Sibilance, voice miking, 168 
Side-chain routing, characteristics, 
475 

Signal chain (path), mixing systems, 
429-430 

Signal distribution, digital audio 
chain, 220-221 
Signal level 

equation, 60-61 
pickup distance, 60 
Signal paths, effeets processing, 
472-475 

Signal processing, see also Digital 
signal processing (DSP) 
control rooms, 9 
device overview, 469-471 
dynamic range, see Dynamic range 
effeets, see Effeets processing 
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limiting, 498-500 
multiple-effects devices, 511 
plug-ins, 471 
time-based effects 
delay, 503-506 
pitch shifting, 509 
psychoacoustics, 508-509 
reverb, 506-508 
time/pitch changes, 509-511 
Signal-to-error ratio, digital signáis, 
207-208 

Signal-to-noise (S/N) limitation, 
513-514 

Simple waves, examples, 54 
Single-ended noise-reduction 
process 

noise gate, 517-518 
overview, 516-517 
16-bit 

quantization, 203-204 
sound files, 214 
Skywalker Sound, 6,77 
Slap back, 71 
Slap echo, 71, 99 
Slap style, 154 
Slip time, 362 
Slope, filters, 480-481 
Slope ratio, compressor, 493 
S5-2L "Sorber" baffle, 93 
Small-fader monitor modes, 440, 
442 

SMPTE timecode 

frame rate standards, 398-399 
LTC and VITC, 399-400 
MTC conversión, 406 
real-world sync 
audio recorders, 409-410 
Computer routing, 411-413 
DAWs, 410-411 
master/slave relationship, 409 
overview, 408 
software, 410 
VCRs, 410 

video and picture sync, 411 
refreshment, 400-401 
and sync 

offset times, 402-403 
overview, 394-395, 401-402 
recording levels, 403 
signal distribution, 403 
timecode word, 395-396 
tutorial, 396 
usage guidelines, 413 
SMPTE timecode word 

biphase representation, 397 
definition, 395-396 
S/N, see Signal-to-noise (S/N) 
limitation 


Snare drum 

compression, 496-497 
frequency, 485 
miking, 162-163 
waveform envelope, 57 
.snd format, 255 
Society of Professional Audio 

Recording Services (SPARS), 
624 

Soffit, and acoustic isolation, 83-84 
Software, general 
advances, 614-615 
CD burning, 579-584 
CD manufacturing, 578-579 
DAW 

accelerator cards, 278-279 
basic function, 257-259 
comping, 267 
DSP effects, 271 
DSP plug-ins, 276-277 
MIDI sequencing and scoring, 
267-269 

mixdown and effects 
automation, 279-281 
mixdown exporting, 281 
mixing, 269-279 
ReWire, 277-278 
sound file copy & paste, 263 
sound file recording, 262 
sound files, 259-267 
sound fixing, 265-267 
video and picture sync, 269 
and device drivers, 371 
instrument plug-ins, 341-342 
and MIDI, 312 
MIDI data, 339 
mixing systems, 430-432 
music printing, 365-367 
real-world sync, 410 
sequencers, 353-354 
updating, 611 

Solid State Logic Duality consolé, 
424 

Solid State Logic XL 9000 K, 431, 
440, 441, 444, 447 
Solo function, channel fader, 444 
Sonar 7 Producer edition, 258 
Song Position Pointer (SPP) 
messages, 331 

Songs 

practice rundown, 30 
sequence editing, 35-36 
Song Select messages, 331 
Sony ACID Pro, 297 
Sony DMX-R100 digital consolé, 459 
Sony MDR-7506 stereo headphones, 
542 

Sony MiniDisc, 229-230 


Sony/Phillips Digital Interface 
(S/PDIF), 216-217 
Sony/Tree Music Studio, 5 
Sound, general 

characteristics, 41-42 
diffraction, 47 
obstacle effects, 48 
reflection, 46-47 
ribbon mic, 113 
waveforms, see Waveforms 
Soundboard, piano miking, 155, 

157 

Soundboard holes, piano miking, 
155 

Sound Designer I format (.sd), 255 
Sound Designer II format (.sd2), 

255 

Soundfield 

examples, 69-70 
monitoring overview, 530 
stereo playback, 465 
Sound files 

bit rates, 213-214 
compressed codee formats, 375 
DAW software, 259-267 
copy & paste, 263 
fixing, 265-267 
recording, 262 
DAW systems, 249 
formats, 255-257 
pitch shifting, 509 
recording tutorial, 262 
sample rates, 212-213 
uncompressed formats, 375 
Sound hole, acoustic guitar miking, 
152-153 

Sound lock, example, 89 
Sound-pressure (SPL) levels 
auditory perception, 66 
and decibel, 59-60 
measurement, 58 
monitor volume, 531-532 
overload distortion, 127 
threshold of feeling, 63 
threshold of hearing, 62 
threshold of pain, 63 
Sound-pressure waves, definition, 41 
SoundSecure™ studio door, 
example, 89 
Sound sources 

distant miking, 133 
isolation, 27 

mic directional response, 120 
"Sound sucker," studio design, 109 
Sound waves 

condenser mies, 117 
mic directional response, 120 
SpaceArray sound diffusers, 100 
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Spaced microphones 

acoustic guitar miking, 153 
overhead miking, 163 
techniques, 142 
Space perception, 69-70 
SPARS, see Society of Professional 
Audio Recording Services 
(SPARS) 

Spatial localization 
definition, 67 
and psychoacoustics, 509 
Spatial positioning, in mixing, 463 
S/PDIF protocol, 216-217 
Speaker design 

active and passive, 526-528 
crossover networks, 526-527 
overview, 525-526 
Speakers, see also Monitoring 
control room design, 96 
and monitoring, 523-525 
placement, 549-550 
polarity, 529-530 
in studio, 544 
Speaker types 
for cars, 543 

farfield monitoring, 538-539 
headphones, 541-542 
nearfield monitoring, 539-540 
small, 541 

Spectral analyzer, 538 

Spectral reference, monitoring, 538 

Speed 

DAW characteristics, 237 
magnetic tape, 186, 193-194 
SPL, see Sound-pressure (SPL) levels 
SPP, see Song Position Pointer (SPP) 
messages 
Spring effect, 508 
Squashed, 492-493 
SRS Circle Surround II, 557 
SRS format, 557 

SRS TruSurround XT process, 557 
SSL Duende mini DSP accelerator 
plug-in, 472 
Stage effect, 508 
Standard MIDI files, 382 
Standing waves 
definition, 98 
room reflections, 99 
Start command, MIDI real-time 
messages, 405 
Status byte, 324 
Steel, surface density, 82 
Steinberg Cubase 4 Media 

Production System, 236, 261 
Steinberg Cubase/Nuendo 
automation modes, 462 
DeNoiser Noise app, 518 


EQ plug-in, 272, 477 
notation edit window, 268 
Notepad apps, 288 
piano roll edit window, 268 
punch-in/out Controls, 357 
seo re window, 367 
Sync Preferences, 412 
virtual mixer, 441 
Steinberg Cubase SX audio 

production software, 354 
Steinberg Cubase SX/Nuendo virtual 
mixer, 434 

Steinberg DoubleDelay VST plug-in, 
275 

Steinberg Groove Agent 3 drum 
plug-in, 305 

Steinberg HALion 3 software 
sampling plug-in, 344 
Steinberg Nuendo 4, 259 
Steinberg Nuendo Control Room 
Mixer, 548 

Steinberg Nuendo Dolby Digital 
Encoder plug-in, 555 
Steinberg Nuendo mixer, 560 
Steinberg Nuendo on-screen mixer, 
270 

Steinberg Xphraze synth plug-in, 

343 

Step time, MIDI recording, 358 
Stereo cutting head, vinyl dises, 
592-593 

Stereo miking techniques 
Decca tree, 145 
M/S, 143-145 
overview, 142 
reamping in mix, 147-148 
spaced pair, 142 
Telefunken M216, 179 
X/Y, 142-143 

Stereo monitoring, characteristics, 
534 

Stereo recording, groove motion, 

592 

Stereo sound 
with sub, 535 
televisión, 547 
Stick-on labels, CDs, 588 
Stopband, 480 

Stop command, MIDI real-time 
messages, 405 
Storage 

CD-R/RW quality, 585 
magnetic tapes, 195 
Streaming data, file sizes, 388 
Stringed instruments 
frequeney, 485 
miking, 166-167 
piano miking, 155 


Studer A827 analog multitrack 
recorder, 185 

Studio, see Portable studio; 

Project studio; 

Recording studio 
Studio design 

absorption, 102-104 
acoustic echo chambers, 

109-110 

frequeney balance, 97-102 
history and considerations, 6-7 
room reflections and reverb, 
107-110 

symmetry, 92-97 
Studio-in-a-box, 615 
Studio managment, tasks, 22 
Studio monitors 

and acoustic isolation, 84 
headphones, 543-544 
speakers, 544 

Studio musicians, as personnel, 19 
Studio Records, LLC, 90 
Studio track log, example, 29 
Studio types 

audio-for-visual production 
environment, 75-76 
portable, 78-79 
professional music studio, 74 
project studio, 76-78 
Stylus RMX real-time groove 
module, 303 

Subcodes, CD manufacturing, 578 
Subwoofer 

monitoring, 532-533 
with stereo, 535 
Summing amplifiers, function, 
419-420 

Supercardioid polar patterns, mic 
characteristics, 120-122 
Surface density, building materials, 
82 

Surround Decca tree, techniques, 146 
Surround miking techniques 
acoustic guitar, 153 
ambient surround mies, 145 
Decca tree, 146 
recording direct, 146-147 
Surround sound 

active/passive monitors, 

551-552 

back catalog reissues, 561 
Dolby Digital (AC3), 553-555 
Dolby Pro Logic, 555-557 
future, 547-548 
history, 546-547 
interfacing, 552-553 
with LFE, 536-537 
mixing, 559-561 
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MP4, 557, 559 
overview, 545-546 
SRS format, 557 
up-mix to 5.1, 559 
without LFE, 535 
WMA, 557 

Surround-sound panners, 560 

Sustain, definition, 57 

Sweetwater Control Room, 8 

Synchronization (sync) 

ATRs, 190-192 
definition, 393 
jam sync, 401 
media devices 

digital audio timing reference, 
408 

LTC and VITC timecode, 
399-400 

overview, 393-394 
SMPTE timecode, 396, 401-403 
sync information data, 396 
timecode frame standards, 
398-399 

timecode overview, 394-395 
timecode refreshment/jam 
sync, 400-401 
timecode word, 395-396 
video timing reference, 407 
MIDI-based, 404-406 
MIDI data, 339 
real-world, 408-413 
video, 269 

Synthesizers (synth) 

characteristics, 342-343 
compression, 497 
FM synthesis, 342, 382 
general MIDI, 383 
MIDI channels, 325 
MIDI system, 320 
miking, 157 
noise gate, 502 
patch data, 336 
project studio, 11 

Sys-Ex (System-exclusive messages), 
322, 335-337 

System-Common messages, MIDI 
specs, 331 

System-exclusive messages (Sys-Ex), 
322, 335-337 

System messages, MIDI specs, 331, 
335 

System Real-Time messages, MIDI 
specs, 331 

T 

Tagged metadata, multimedia, 
380-381 

Tails-out position, ATRs, 195 


Tannoy System 215 DMTII main 
reference monitor, 539 
Tape machines, control room, 8 
Tascam DA-98HR modular digital 
multitrack recorder, 233 
Tascam DA-P1 portable DAT 
recorder, 232 

Tascam Digital Interface (TDIF), 
219-220 

Tascam DP-02CF compact flash 
digital system, 228 
Tascam FW-1884, 253 
Tascam Portastudio, 227 
Tascam X-48 multitrack hard-disk 
recorder, 227 
Tax tips 

client entertainment, 629 
deductions, 626-627 
end-of-year, 630 
gear deductions, 627-628 
health insurance, 629 
home office, 628 
IRS help, 630 

legal business steps, 625-626 
retirement plan, 630 
self-employment tax, 628-629 
vehicle deductions, 629 
te electronic Master X3 multiband 
dynamic plug-in processor, 
571 

te electronic Powercore DSP 

processor, 273-275, 280 
te electronic System 6000 digital 
effeets processor, 507 
te electronic UnWrap PowerCore 
Plug-in, 559 

TDIF (Tascam Digital Interface), 
219-220 

TDM (Time Domain Multiplex), 

277, 471 

Telefunken M216 stereo 
microphone, 179 
Televisión, stereo sound, 547 
Tempo, MIDI session, 355-356 
Temporal fusión, definition, 70 
Tensión, magnetic tape, 186 
Terrazzo, absorption coefficients, 103 
T'Fusor™ sound diffusers, 100 
Theater sound, with LFE, 537 
Theory of electromagnetic 

induction, definition, 112 
Third harmonic, example, 53 
3:1 distance rule 
definition, 137 
example, 139 
Thresholds 

compressor, 493 
ear pain, 63 


feeling, 63 
hearing, 62 
Thru jack, MIDI, 319 
Tie-straps, 607 
Timbre 

cióse miking, 139 
definition, 56 

Time, and tape length, 186 
Time-based effeets 
delay, 503-505 
delay tutorial, 506 
pitch shifting, 509 
psychoacoustics, 508-509 
reverb, 506-508 
time/pitch changes, 509-511 
Time-base errors, example, 221 
Time change 

DSP effeets, 276 
effeets, 509-511 
groove tools, 292 

Timecode, see MIDI timecode (MTC); 

SMPTE timecode 
Time Domain Multiplex (TDM), 

277, 471 

Timing dock, MIDI real-time 
messages, 404 
Timing reference 
digital audio, 408 
video signal, 407 

Tip-ring-sleeve (TRS), 434, 436, 450 
Tip-sleeve, 450 

TL (transmission loss), 81-84 
Todd-AO, 546 
Toms, 164-165, 485 
Tonal balance, see Timbre 
Tones 

and auditory perception, 64-65 
combination, 66 
Total transport logic (TTL), 
advantages, 186-187 
Track-at-once writing, characteristics, 
584 

Tracks 

inline effect, 474 
and monitoring, 443 
recording process, 425-426 
Transducers 

digital terminology, 200 
ear as, 61 
function, 37-39 
mic as, 111 

mic selection, 172-173, 175-176 
mic types, 112 
MIDI drums, 351 
Transient response 

drum set miking, 161 
mic characteristics, 125 
mic examples, 125 
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Transistor, amplification process, 

415- 417 

Transitions, mastering sequencing, 
568 

Transmission loss (TL), 81-84 
Transport, magnetic tape, 184-187 
Transposition, MIDI editing, 361 
Trombone, 57, 151 
TRS (tip-ring-sleeve), 434, 436, 450 
True Blumlein shuffling, 144 
Trumpet, mic placement, 150-151 
Tuba, mic placement, 151 
Tubes, mic preamps, 131 
Tumbas, 166 

Tune Request messages, 331 
Tuning, room considerations, 524 
Turno ver, 480 

Tutorials, see Do It Yourself (DIY) 
tutorials 

20-bit, sound files, 214 
24-bit, sound files, 214 
Type 0 format, MIDI files, 359 
Type 1 format, MIDI files, 359 

u 

UAD-2 DSP PCle DSP processor, 
279 

U-Boat™ floor beam, example, 85 
llltrapatch PX2000 patch bay, 450 
Unbalanced Unes, mic 

characteristics, 130 
Uncompressed sound file formats, 
375 

Uninterruptible power supply 
(UPS), 457, 610-611 
Universal Audio 1176N amplifier, 
492 

Universal Audio UAD Precisión 
Multiband plug-in, 498 
Universal Recording, 614 
Up-mix, surround sound, 559 
Upper partials, definition, 52 
Upright piano, miking, 156-157 
UPS, see Uninterruptible power 
supply (UPS) 

USB interface, 228, 245-246, 338 
User-friendly operaiton, DAW 
characteristics, 237 

v 

V, see Voltage (V) 

Vacuum tube, amplification process, 

416- 417 

Valve, see Amplifiers (amp) 

Variable bit rate (VBR), 377 
VBR, see Variable bit rate (VBR) 

VCA (voltage-controlled amplifier), 
421-422 


VCR (video cassette recorder), 410, 
547 

Vector graphics, 383, 386 
Vehicle expenses, tax tips, 629 
Velero™, 607 

Velocity, waveform characteristics, 

46 

Velour, absorption coefficients, 103 
Vertical interval timecode (VITC), 
399-400 

Vibraphone, miking, 166 
Vibrating mass, air effeets, 43 
Video 

audio for, 17 
DVD formats, 373 
real-world sync, 411 
Video cassette recorder (VCR), 410, 
547 

Video games, surround sound, 548 
Video jockey (VJ), as personnel, 22 
Video signal, timing reference, 407 
Video sync, 269, 411 
Vinyl disc manufacture 
cutting head, 591-593 
disc cutting, 591-594 
mastering process, 595 
overview, 590 
pitch control, 593-594 
plating and pressing, 595-596 
recording dises, 594 
Vinylite, vinyl disc manufacture, 595 
Viola, 54-55, 167 
Violin, 54-55, 167 
Virtual e-dub, 391 
Virtual effeets, and hardware, 
476-478 
Virtual mixers 

drawn automation, 463 
mixdown and effeets 
automation, 460-461 
read mode, 463 
technology overview, 457-460 
write mode, 461-463 
Virtual pot (V-pot), 458 
Virtual Studio Technology (VST), 
277 

VITC, see Vertical interval timecode 
(VITC) 

VJ, see Video jockey (VJ) 

Vocal booths, 90-91 
Vocals 

compression, 497 
frequeney, 485 
miking, 112, 149 
Voice 

miking, 168-169 
mixing and balancing, 465 
samplers, 346 


Voice circuitry, electronic 

instruments, 340-341 
Voice coil, dynamic mies, 112-113 
Voltage (V) 

and amplification, 416-417 
and charge, 115 
and decibel, 60 
and power conditioning, 457 
and sound waves, 117 
Voltage-controlled amplifier (VCA), 
function, 421-422 
Volume 

mastering process, 568-569 
monitors, 531-532 
V-pot, see Virtual pot (V-pot) 

VST (Virtual Studio Technology), 
277 

VU metering 

dynamic range, 489-490 
mixing systems, 451-453 

w 

"Wahing" effect, 171 
Wallace, Roy, 145 
Walls 

and acoustics, 81-84, 86, 89-90, 
94, 98-99, 104-105 
piano miking, 157 
pitch control, 594 
surround sound, 546-547 
Walters, Beth, 4 

Warping, groove tools, 292-293 
Wave (.wav) format, 255, 375 
Wave64 (.wav64) format, 255 
Waveforms 

amplifier function, 418 
amplitude, 43-45 
characteristics, 42-43 
distortion, 221 
envelope, 56-57 
frequeney, 45 
harmonic content, 52-56 
phase, 48-51 
phase shift, 52 
room reflections, 98-99 
sound file recording, 260-261 
velocity, 46 
wavelength, 46-48 
Wavelength 

frequeney relationship, 47 
frequeney response, 47-48 
sound diffraction, 47 
sound reflection, 46-47 
waveform characteristics, 46-48 
Wave movement, 42 
Wave propagation, 42 
Waves L1 Ultramaximizer plug-in, 
499 
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Waves SI Stereo Imager plug-in, 144 
Wavetable synthesis, 342 
WDM, see Windows Driver Model 
(WDM) 

Web, see World Wide Web (WWW) 
Wells, Frank, 18 

Westlake BBSM-15 Reference Series 
monitor, 527 
White Book CD, 372 
Windows Driver Model (WDM), 
251-252 

Window size, desktop video, 386 
Windows Media Audio (WMA), 
378-379, 557 

Windows XP, sound formats, 
557-558 

Wireless MIDI, 317-318 
Wiring 

MIDI cable, 315 
MIDI phantom power, 317 
The Wizard of Oz, 312 
WMA, see Windows Media Audio 
(WMA) 

Women, in industry, 23 
Women's Audio Mission, 23 
Wood 

absorption coefficients, 103 
surface density, 82 
Woodwind instruments, miking, 
169-170 

Working distance, and pickup 
characteristics 


ambient miking, 141 
cióse miking, 135-140 
distant miking, 133-135 
overview, 132-133 
Worldclock, digital recording chain, 
222-224 

World Wide Web (WWW) 
audio distribution, 10 
basic operation, 373-374 
connection speeds, 387 
continuing education, 623 
copyright protection, 390 
DAW connection, 247 
as delivery media, 373-374 
DIY suggestions, 3 
impact, 619-620 
instrument samples, 346-347 
and multimedia 
copyright protection, 390 
definition, 369 
delivery formats, 374-375 
desktop video, 386-387 
as environment, 369-371 
general MIDI, 382-383 
graphics imaging, 383, 386 
Internet radio, 391 
and Internet speed, 387-388 
MIDI, 381-382 
perceptual coding, 375, 
377-380 

secure digital music, 390-391 
speed, 387-388 


standard MIDI files, 382 
tagged metadata, 380-381 
virtual e-dub, 391 
web distribution, 388-390 
product distribution, 388-390 
product manufacture for, 596 
Write mode, digital consoles, 
461-463 

x 

Xlogic SuperAnalogue mic amp, 131 
Xylophone, miking, 166 
X/Y stereo miking 

acoustic guitar, 152, 153 
overhead miking, 163 
techniques, 142-143 

Y 

Yamaha DS6IV PRO Disclavier 
MIDI piano, 348 
Yamaha HS-80M studio monitor 
speakers, 540 

Yamaha MOTIF XS6 keyboard 
workstation, 225 
Yellow Book CD, 372 

z 

Z channels, example, 87 
Zone, samplers, 346 
Zoom H4 4-track portable recorder, 
229 


